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CONTINUUM SPACE LIMIT OF THE GENEALOGIES OF 
INTERACTING FLEMING-VIOT PROCESSES ON Z 
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Abstract. We study the evolution of genealogies of a population of individuals, whose 
type frequencies result in an interacting Fleming-Viot process on Z. We construct and 
analyze the genealogical structure of the population in this genealogy-valued Fleming- 
Viot process as a marked metric measure space, with each individual carrying its spatial 
location as a mark. We then show that its time evolution converges to that of the 
genealogy of a continuum-sites stepping stone model on R, if space and time are scaled 
diffusively. We construct the genealogies of the continuum-sites stepping stone model 
as functionals of the Brownian web, and furthermore, we show that its evolution solves 
a martingale problem. The generator for the continuum-sites stepping stone model has 
a singular feature: at each time, the resampling of genealogies only affects a set of 
individuals of measure 0. Along the way, we prove some negative correlation inequalities 
for coalescing Brownian motions, as well as extend the theory of marked metric measure 
spaces (developed recently by Depperschmidt, Greven and Pfaffelhuber |DGP12| ') from 
the case of probability measures to measures that are finite on bounded sets. 

AMS 2010 subject classijication: 60K35, 60J65, 60J70, 92D25. 
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1. Introduction and main results 

In the study of spatial population models on discrete geographic spaces (for example 
such as branching processes, voter models, or interacting Fisher-Wright diffusions 
(Fleming-Viot models), the technique of passing to the spatial continuum limit has proven 
to be useful in gaining insight into the qualitative behaviour of these processes. A key 
example is branching random walks on leading to the Dawson-Watanabe process 
|Daw77| on and Fisher-Wright diffusions; catalytic branching and mutually catalytic 
branching on Z, leading to SPDE on M [KS88( IEF96( IDP98( lDEF~*~021)1 [DEF^02a]. The 
goal of this paper is to carry out this program at the level of genealogies, rather than just 
type or mass configurations. We focus here on interacting Fleming-Viot models on Z. 

1.1. Background and Overview. We summarize below the main results of this paper, 
recall some historical background, as well as state some open problems. 

Summary of results. The purpose of this paper is twofold. On the one hand, we want 
to understand the formation of large local one-family clusters in Fleming-Viot populations 
on the geographic space Z^, by taking a space-time continuum limit of the genealogical 
configurations equipped with types. On the other hand we use this example to develop 
the theory of tree-valued dynamics via martingale problems in some new directions. In 
particular, this is the first study of a tree-valued dynamics on an unbounded geograph¬ 
ical space with infinite sampling measure, which requires us to extend both the notion 
of marked metric measure spaces in |GPW09l IDGPll] and the martingale problem for¬ 
mulations in |GPWI3l IDGP12] to marked metric measure spaces with infinite sampling 
measures that are boundedly finite (i.e., finite on bounded sets). 

Here is a summary of our main results: 

(1) We extend the theory of marked metric measure spaces [GPWOh] IDGP12j from 
probability sampling measures to infinite sampling measures that are boundedly 
finite, which serve as the state space of marked genealogies of spatial population 
models. See Section O 

(2) We characterize the evolution of the genealogies of interacting Fleming-Viot (IFV) 
models by well-posed martingale problems on spaces of marked ultrametric measure 
spaces. See Section [L3l 

(3) We give a graphical construction of the spatial continuum limit of the IFV genealogy 
process, which is the genealogy process of the so-called Continuum Sites Stepping- 
stone Model (CSSM), taking values in the space of ultrametric measure spaces 
with spatial marks and an infinite total population. The graphical construction is 
based on the (dual) Brownian web [FINR04] . The CSSM genealogy process has 
the feature that, as soon as t > 0, the process enters a regular subset of the state 
space that is not closed under the topology. Only on this subset we can evaluate 
the action of the operator of the martingale problem in its action on test functions. 
The nice aspect is that the set of these states are preserved under the dynamic. 
This leads to a singular structure with complications for the associated martingale 
problem and for the study of continuity of the process at time 0. See Section [1.41 

(4) We prove that under suitable scaling, the IFV genealogy processes converge to 
the CSSM genealogy process. The proof is based on duality with spatial coales- 
cents, together with a novel approach of controlling the genealogy structure using 
a weaker convergence result on the corresponding measure-valued processes, with 
measures on the geographic and type space (with no genealogies). See Section fL5l 

(5) We show that the CSSM genealogy process solves a martingale problem with a 
singular generator. More precisely, the generator action involves individuals, which 
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are not typical under the sampling measure, so that the dynamic is driven by 
atypical individuals at atypical locations. In particular, the generator is only 
defined on a regular subset of the state space. See Section [L6l 
(6) We prove some negative correlation inequalities for coalescing Brownian motions, 
which are of independent interest. See Appendix ICl 

Besides the description of the genealogies of the current population, we also prepare the 
ground for the treatment of all individuals ever alive, i.e. fossils, moving from the state 
space of marked ultrametric measure spaces to the state space of marked measure M-trees, 
which will be carried out elsewhere. 

History of the problem: Why are we particularly interested in one-dimensional ge¬ 
ographic spaces for our scaling results? Many interacting spatial systems that model 
evolving populations, i.e., Markov processes with state spaces 1^(1 = M,N, [0,1], etc., 
and G = or the hierarchical group that evolve by a migration mechanism be¬ 
tween sites and a stochastic mechanism acting locally at each site, exhibit a dichotomy 
in their longtime behavior. For example, when G = and the migration is induced by 
the simple symmetric random walk: in dimension d <2, one observes convergence to laws 
concentrated on the traps of the dynamic; while in d > 3, nontrivial equilibrium states 
are approached and the extremal invariant measures are spatially homogeneous ergodic 
measures characterized by the intensity of the configuration. Typical examples include 
the voter model, branching random walks, spatial Moran models, or systems of interact¬ 
ing diffusions (e.g., Feller, Fisher-Wright or Anderson diffusions). One obtains universal 
dimension-dependent scaling limits for these models if an additional continuum spatial 
limit is taken, resulting in, for example, super Brownian motion (see Liggett |Lig85| or 
Dawson [Daw Ml). 

In the low-dimension regime, the cases d = 1 and d = 2 are very different. In d = 2, one 
observes for example in the voter model the formation of mono-type clusters on spatial 
scales with a random a G [0,1], a phenomenon called diffusive clustering (see Cox and 
Griffeath [CG86] i. In the one-dimensional voter model, the clusters have an extension of a 
fixed order of magnitude but exhibit random factors in that scale. More precisely, in space- 
time scales {Vi, t) for t —)• oo, we get annihilating Brownian motions. Similar results have 
been obtained for low-dimensional branching systems (Klenke |Kle00| . Winter |Win02] i. 
systems of interacting Fisher-Wright diffusions (Fleischmann and Greven [FG94] . |FG96] 
and subsequently IZho03l . lDEF'*~00l ) and for the Moran model in d = 2 (Greven, Limic, 
Winter |GLWn5j i. 

In all these models, one can go further and study the complete space-time genealogy 
structure of the cluster formation and describe this phenomenon asymptotically by the 
spatial continuum limit. In particular, the description for the one-dimensional voter model 
can be extended to the complete space-time genealogy structure, obtaining as scaling limit 
the Brownian web |Arr79[ [Arr811 [TW981IFINR04] (see AppendixjBl and the recent survey 
[SSSlSj i. More precisely, the Brownian web is defined by considering instantaneously 
coalescing one-dimensional Brownian motions starting from every space-time point in M x 
M. It arises as the diffusive scaling limit of continuous time coalescing simple symmetric 
random walks starting from every space-time point in Z x M, which represent the space- 
time genealogies of the voter model. This is analogous to the study of historical process for 
branching processes, which approximates the ancestral paths of branching random walks 
by that of super Brownian motion (see e.g. [DP911 [FG961 [GLW05| i. 

The basic idea behind all this is that, we can identify the genealogical relationship 
between the individuals of the population living at different times and different locations. 
This raises the question of whether one can obtain a description of the asymptotic behavior 
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of the complete genealogical structure of the process on large space-time scales, which will 
in turn allow for asymptotic descriptions of interesting genealogical statistics that are not 
expressible in a natural way in terms of the configuration process. 

These observations on the genealogical structure goes back to the graphical construction 
of the voter model due to Harris, and continues up to the historical process of Dawson and 
Perkins for branching models |DP91] . or representation by contour processes [G.T98[[Ald93| . 
To better describe genealogies, the notion oiM-trees, marked M-trees or marked measure M- 
trees were developed as a framework [Eva971 [EPWOGI . These objects contain the relevant 
information abstracted from the labeled genealogy tree, where every individual is coded 
with its lifespan and its locations at each time. Such a coding means in particular that 
all members of the population are distinguished, which information is mostly not needed. 
In the large population limit, it suffices to consider the statistics of the population via 
sampling. 

For this purpose, one equips the population with a metric (genealogical distance), a 
probability measure (the so-called sampling measure, which allows to draw typical finite 
samples from the population) together with a mark (specifying types and locations). This 
description in terms of random marked metric measure spaces (in fact, the metric defines 
a tree) is the most natural framework to discuss the asymptotic analysis of population 
models, since it comprises exactly the information one wants to keep for the analysis in 
the limits of populations with even locally infinitely many individuals. The evolution of a 
process with such a state space is described by martingale problems [GPWl.Sl IDGP12] . 

Open problems and conjectures. We show in this paper that the spatial continuum 
limit of the one-dimensional interacting Fleming-Viot genealogy process solves a martin¬ 
gale problem. However, due to the singular nature of the generator, the uniqueness of this 
martingale problem is non-standard, and we leave it for a future paper. In particular, it is 
difficult to establish the duality relation at the level of generators, because the generators 
for the process and its dual are only defined for special test functions at special points. 

Instead of Z, resp. M, as geographic spaces, one could consider the hierarchical group 
Hjv = ©N-Zw, with Z]y being the cyclical group of order N, respectively the continuum 
hierarchical group Zjy. Brownian motion on M can be replaced by suitable Levy 

processes on and the program of this paper can then be carried out. The Brownian 
web would have to be replaced by an object based on Levy processes as studied in |EF96| . 
[DEF+nn] . We conjecture that the analogues of our theorems would hold in this context. 
A further challenge would be to give a unified treatment of these results on M, 

Another direction is to consider the genealogy processes of interacting Feller diffusions, 
catalytic or mutually catalytic diffusions, interacting logistic Feller diffusions, and derive 
their genealogical continuum limits. A more difficult extension would be to include an¬ 
cestral paths as marks, which raises new challenges related to topological properties of the 
state space. Namely, as the space of paths is a Polish space, it is not a Heine-Borel space 
as closed balls around a path are not compact. 

Outline of Section [1]. The remainder of the introduction is organized as follows. In 
Subsection ll.2l we recall and extend the notion of marked metric measure spaces needed to 
describe the genealogies. In Subsection 11.31 we define the interacting Fleming-Viot (IFV) 
genealogy process via a martingale problem and give a dual representation in terms of a 
spatial coalescent. In Subsection 11.41 we give, based on the Brownian web, a graphical 
construction for the continuum-sites stepping stone model (CSSM) on M and its marked 
genealogy process, which is the continuum analogue and scaling limit of the interact¬ 
ing Fleming-Viot genealogy process on Z, under diffusive scaling of space and time and 
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normalizing of measure, a fact which we state in Subsection 11.51 In Subsection 11.61 we 
formulate a martingale problem for the CSSM genealogy process. In Subsection 11.71 we 
outline the rest of the paper. 


1.2. Marked Metric Measure Spaces. In this subsection we introduce the state space 
of the genealogies of interacting Fleming-Viot processes. We want to describe evolving 
genealogies of the whole population of all individuals currently alive allowing to sample 
from this population. We also want to include the individuals’ positions in geographic 
space and possible genetic types. We therefore regard genealogies as (equivalence classes) 
of marked metric measure spaces. As our geographic space is infinite (and not compact), 
it won’t be possible to sample individuals by means of a finite (or after renormalizing 
of a probability) measure. We rather require the sampling measure to be finite on all 
subpopulations which can be obtained by restricting to finite geographic subspaces. 

The following definition of marked metric measure spaces extends the one introduced 
in [DGPllj which considered probability measures only. 

Definition 1.1 (F-mmm-spaces). Let (Vjry) be a complete separable metric space with 
metric ry, and let o be a distinguished point in V. 

1. We call (A, r, /i) a V-marked metric measure space (V-mmm space for short) if: 

(i) {X, r) is a complete separable metric space, 

(ii) n is a measure on the Borel a-algebra of X xV, with fa{X x Bo{R)) < oo for 
each ball Bo{R) C V of finite radius R centered at o. 

2. We say two V-mmm spaces {X,rx, fJ^x) and {Y,rY,faY) are equivalent if there 
exists a measureable map (p : X ^ Y, such that 

( 1 . 1 ) rx{xi,X 2 ) = rY{p{xi),ip{x 2 )) for all xi, 3:2 G supp(/rx(-x 17)), 
and ifip: XxV^YxV is defined by (p{x,v) := {ip{x),v), then 

( 1 . 2 ) p,Y = hx°P~^- 

In other words, ip is an isometry between supp{px{- x 17)) and supp{pY{- x 17)), 
and the induced map <p is mark and measure preserving. We denote the equivalence 
class of {X, r, p) by 


(1.3) 


3. 


(1.4) 


(1.5) 


(A, r,p). 

The space of (equivalence classes of) V-mmm spaces is denoted by 

:= {(A,r,/r) : {X,r,p) is a V-mmm space}. 

The subspace of (equivalence classes of) V-mmm spaces which admit a mark func¬ 
tion is denoted by 




:= {(A, r, p) G : 3 measurable k : A —)• 17 s.t. p{d{x, •)) = p{dx xV)® 

Note that depends both on the set 17 and the metric ry since the latter defines the 
sets on which the measure must be finite. 

Marked metric measure spaces were introduced in |DGP11| , which extends the notion of 
metric measure spaces studied earlier in |GPW09] . Definition 11.11 is exactly the analogue 
of |DGPlll Def. 2.1], where p is a. probability measure. The basic interpretation in our 
context is that: A is the space of individuals; r{xi,X 2 ) measures the genealogical distance 
between two individuals xi and X 2 in A; the measure p is a measure on the individuals 
and the marks they carry (which can be spatial location as well as type, or even ancestral 
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paths up to now), allowing us to draw samples from individuals with marks in a bounded 
set. 

To define a topology on that makes it a Polish space, we will make use of the marked 
Gromov-weak topology introduced in [DGPll) Def. 2.4] for P-mmm spaces equipped with 
probability measures. In this topology a sequence converges iff we can embedd the involved 
metric spaces isometrically into one metric space such that the images of the “sampling” 
measures converges weakly. The basic idea to extend this to is that, given our as¬ 
sumption on p, in Definition 1 1.1 1 1. (ii). we can localize /i to finite balls in V to reduce p, 
to a finite measure. We can then apply the marked Gromov-weak topology (which also 
applies to finite measures instead of probability measures) to require convergence for each 
such localized version of the V-mmm spaces. We will call such a topology V-marked 
Gromov-weak"^ topology, replacing weak by weak"^, following the terminology in [DV.TO.Sl 
Section A2.6] for the convergence of measures that are bounded on bounded subsets of a 
complete separable metric space. Note that vague convergence is for measures that are 
finite on compact rather than bounded subsets. Both notions agree on Heine-Borel spaces 
(compare, |ALW 14| ). 

Definition 1.2 (Wmarked Gromov-weak^ Topology). Fix a sequence of continuous func¬ 
tions ■ V ^ [0,1], A: € N, such that ^ on Bo{k), the ball of radius k centered at o G 
V, and ipk = 0 on B^{k + 1). Let x '■= (A, r, /r) and Xn ■= (A^, rn, iJ-n), n G N, 6 e elements 
of . Let ■ pL he the measure on X xV defined by {ifk ■ A^)(d(x, n)) := ifk{v)fJ-{d{x, v)), 
and letipk'Ln be defined similarly. We say thatXn —X A/ie V-marked Gromov-weak"^ 

n^oo 

topology if and only if: 

(1.6) {Xn,rn,'f’k • Fn) (A, r, • l) A/ie Gromov-weak topology for each k gN. 

n^oo 

When V = we may choose ifk to be infinitely differentiable. 

Remark 1.3 (Dependence on o and (V’A:)fceN)- Note that the P-marked Gromov-weak"^ 
topology does not depend on the choice o G V and the sequence (V’fc)fceN) as long as fik 
has bounded support and Ak := {n : fikiv) = 1} increases to P as A: —)• oo. 

Remark 1.4 (M'^ as a subspace of (Mj)^). Let Mj denote the space of (equivalent 
classes of) P-mmm spaces with finite measures, equipped with the P-marked Gromov- 
weak topology as introduced in |DGPlll Def. 2.4]. Then it is a well-known fact that each 
element (A, r, pi) G can be identified with a sequence ((A, r, ifi - pi), (A, r,'if 2 •//),...) in 
the product space (M^ )^, equipped with the product topology. This identification allows 
us to easily deduce many properties of from properties of that were established 
in |DGPllj . In particular, we can metrize the P-marked Gromov-weak^ topology on 
by introducing a metric (which can be called P -marked Gromov-Prohorov"tl" metric) 

(1-7) ^ _ 

, -r tt; -dMGp((Ai,ri,' 0 fc • ^i), (A 2 ,r 2 ,V’fc • At 2 )) A 1 

«MGP#((Ai,D,m),(A2,r2,//2)) := 2^--, 

k=l 

where dMGP is the marked Gromov-Prohorov metric on , which was introduced in |DGPlll 
Def. 3.1] and metrizes the marked Gromov-weak topology. 

The proof of the following result is in Appendix O 

Theorem 1.5 (Polish space). The space M^, equipped with the V-marked Gromov-weakfi" 
topology, is a Polish space. 
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Points in M^, as well as weak convergence of M'^-valued random variables, can be 
determined by the so-called polynomials on M^, which are defined via sampling a finite 
subset on the P-mmm space. 

Definition 1.6 (Polynomials). Let n G N. Let g G Chh{V"',M), the space of real-valued 
bounded continuous function on with bounded support. For A: G N U {0,oo}, let (f G 

/riN /'n\ 

Cj , M), the space of k-times continuously differentiable real-valued functions on 
with bounded derivatives up to order k. We call the function —>■ M defined by 

(1.8) c^-’'>’5((X^) :=/•••/ mgiv)fa^^idix,v)), 

a monomial of ordern, wherev := {vi,... ,Vn), x := {xi,... ,Xn), r = r(x) := {r{xi,Xj))i<i^j<n 
and /i®"'(d(x, n)) denotes the n-fold product measure of g, on {X x V)"'. 

(a) Let := {$"'’'^’3 ; (f g G Cf,b('P”', M)}, which we call the space 

of monomials of order n (with differentiability of order k). Let IIq be the set of 
constant functions. We then denote by 

:= 

the set of all monomials (with differentiability of order k). 

(b) For P = and k,l G N U {0, oo}, we define 

Uy := ^ e C^(R^fi\R),g G 

and := UngNonn^ 

(c) We call the linear spaces 

(1.9) generated by 

the polynomials (with differentiability of <(>, resp. g, of order k, resp. £). 

Remark 1.7. Note that the polynomials form an algebra of bounded continuous func¬ 
tions since the product of two monomials can be seen as a new monomial as defined in 
n. However, the sum of two monomials in general is not a monomial. 

Throughout the paper we are often interested in the following sub-space of (com¬ 
pare, e.g., Definition 11.341 of so-called regular space of states). Let 6 be a measurable 
function on M_|_, and write 

(1.10) := {(X,r, g) G : g{X x R^(o)) < b{r)}. 

If /3 is a measure on B(V) which is finite on bounded subsets, and 

(1.11) := {{X,r,g) G : g{X x •) = /3}, 

then obviously is a closed subspace of with 6(r) := fi{Br{o)). 

Theorem 1.8 (Convergence determining class). Fix a measurable function (3 : M+ —)• M+. 

We have the following properties for H^, for each A: G N U {0, oo}.' 

(i) is convergence determining in Namely, {Xn,rn, gn) —^ 

if and only if ^{{Xn,rn, gn)) h)) as n ^ oo for all G H^. 

(ii) is also convergence determining in A1i(M(^’-^)), the space of probability mea¬ 
sures on Namely, a sequence of-valued random variables (<l)i)neN 

converges weakly to a -valued random variable X if and only f/E[<h(T’„)] —>■ 

E[‘h(T’)] as n ^ oo for all G H^. 
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(iii) For F = and each k,l G NU {0,oo}, II^’^ is also convergence determining in 
and 

We defer the proof of Theorems li.5l and \1.^ as well as some additional properties of 
V-mmm spaces, to Appendix\M 

Remark 1.9. For the models we consider, the genealogies lie in certain particular 
Polish spaces which arise as closed subspaces of M^. Note that the current population 
alive corresponds to the leaves of a genealogical tree, and the associated P-mmm space 
is ultrametric. We will denote the space of P-marked ultrametric measure spaces by U^. 
They form a closed subspace of and hence is Polish. The same holds for , 

for some Borel measure (5 onV which is finite on bounded subsets. 

Remark 1.10. Recall from Definition ll.il and notice that is not closed, and 
that we therefore choose the bigger space as the state space. The space allows an 
individual x G X to carry a set of marks, equipped with the conditional measure of p. on 
V given x G X. If each individual carries only a single mark which we can identify via a 
mark function k : X ^ V, the corresponding marked metric measure space is an element 
of . This will be the case for the interacting Fleming-Viot process that we will study. 
It can be shown that every element in is an element of the closed space 

u 

A bdd <5>0 

e Mf{X X A) : < /i(. X ^), ||^' - /i(. x R)||tv < hA{S), 

< 5,d{vi,V2) > /ia(5)} = O}, 

for some hA Gki, where 

(1.13) FL := {h : R+ —>• R+ U {oo} : h is continuous and increasing, and h{0) = 0} 
l [KL15l Lemma 2.8]). 

1.3. Interacting Fleming-Viot (IFV) Genealogy Processes. We now study the ge¬ 
nealogies of the measure-valued interacting Fleming-Viot (IFV) processes on a countable 
geographic space and with allelic types, typically taken from the type space [ 0 , 1 ] (see 
[DGV95] for details on IFV), which is motivated by the following individual-based model, 
the so-called Moran model. 

Consider a population of individuals, X, with locations indexed by a countable additive 
group V (for us this later will be Z). The individuals migrate independently according to 
rate one continuous time random walks with transition probability kernel a(-, •), 

(1.14) a(ui,U 2 ) = a(0,U 2 — ui) for all vi,V 2 €V. 

We denote the transition kernel of the time reversed random walks by 

(1.15) d{vi,V2) ■■= a{v2,vi) = a(0, vi - V 2 ). 

Individuals furthermore reproduce by resampling, where every pair of individuals at the 
same site dies together at exponential rate 7 > 0 , and with equal probability, one of the 
two individuals is chosen to give birth to two new individuals at the same site with the 
same type as the parent. This naturally induces a genealogical structure. The genealogical 
distance, r, of two individuals at time t is 2min{t,T} plus the distance of the ancestors 
at time 0, where T is the time it takes to go back to the most recent common ancestor. 
Imposing the Haar measure (here the counting measure), fu, on the population with each 
individual carrying its location as a mark, we obtain a V-mmm space and its equivalence 
class is the state of the genealogy process. 
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Letting now the number of individuals per site tend to infinity and normalizing the 
measure such that each site carries population mass of order one, we obtain a diffusion 
model, the interacting Fleming-Viot (IFV) genealogy process with state space 

(1.16) U])':= 

where n denotes the Haar measure on the countable geographic space V, the 1 indicating 
that the measure restricted to each colony is a probability measure. This (see Remark 
[LSI) is again a Polish space. (For the diffusion limit in the case of a finite geographic space 
P, see |(lPW13j . |D(lP12j L 

Remark 1.11. If we introduce as marks (besides locations from a countable geographic 
space G) also allelic types from some set K (typically taken as a closed subset of [0,1]), 
then the type is inherited at reproduction and P = K x G is the product of type space 
and geographic space. In this case the localization procedure in Definition 11.21 applies to 
the geographic space, since IK is compact. 

1.3.1. The genealogical IFV martingale problem. We now define the interacting Fleming- 
Viot (IFV) genealogy process via a martingale problem for a linear operator iF^ on 
(^^(U])", M), acting on polynomials. For simplicity we first leave out allelic types, which we 
introduce later in Remark 1 1.1 31 

The linear operator on (^^(Uj)^,M), with domain 11^’*^ as introduced in Defini¬ 
tion II.6l fbi with d = 1, consists of three terms, corresponding respectively to aging, 
migration, and reproduction by resampling. With X = {X,r,iJ,) and = <1)’^’'^’^ £ 11^’*^, 


L^^^>(T’) = 2 


(1.17) 


{XxVV 

/ 

(Xxvy 


T^^id{x,v))g{v) 

l<k<i<n 


d 


drky 


Hl) 


T^’"(d(x,v)) (f){g) - g){v) 

j=l 


+ 27 / 7 t®"(d(x,u)) c/(u) ^ \vk=vi}{Gk,e(>- 


{XxVY' 


l<k<e<n 


where x = (xi, ■ ■ ■ ,Xn), v = (ui, • • • ,Vn), L ■= {rk,£}i<k<£<n ■= irixk,xe)i<k<e<n), and 

(1.18) {M^.yg){vi,--- ,Vn) := g{vi,--- ,Vj-i,v',Vj+i, ■ ■ ■ ,Vn) 

encodes the replacement by migration of the j-th sampled individual corresponding to a 
jump from location Vj to v', while 9^/ encodes the replacement of the Lth individual by 
the A:-th individual (both at the same site). More precisely, 

' r(xi,xj) ifiJFF 

(1.19) {9k,£()>){£)■= 4>{^k,eg), with {9k,£g)i,j := < r{xi,Xk) if j = L 

r{xk, Xj) i = 1. 

The first result states that there is a unique Uj^-valued diffusion process associated with 
this operator. 


Theorem 1.12 (Martingale problem characterization of IVF Genealogy processes). For 
any Xq = (Xo,ro, fto) € U]^, we have: 

(i) The {L^^,11^’^, 5xo)-martingale problem is well-posed, i.e. there exists a -valued 
process X^^ := {X^^)t>o, unique in its distribution, which has initial condition 
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Xq and cddldg path, such that for all $ E 11^’^ and w.r.t. the natural filtration 
generated by 


t 

(1.20) — J is a martingale. 

0 

(ii) The solutions (for varying initial conditions) define a strong Markov with contin¬ 
uous path. This Markov process has the Feller property, i.e., the one-dimensional 
distributions depend continuously on the initial distribution. 

(iii) If the initial state admits a mark function, then so does the path almost surely. 

Remark 1.13. If we add the type of an individual as an additional mark, i.e., V := 
G X IK with geographic and type space respectively, then the same result holds if we modify 
as follows. We require the states to satisfy the constraint that the projection of p. on the 
geographic space V is still the counting measure. The test functions <I> should be modified 
so that we multiply g : G'^ M, acting on the locations of the n sampled individnals, by 
another bounded, continuous factor gtyp : K” —M, acting on the types of the individuals. 
The generator should be modified accordingly, so that gtyp changes at resampling 
from gtyp to ^typ ° dk,i, with 6^/ replacing the Gth sampled individual by the k-th. one, see 
[D(IP12p . 

Remark 1.14. The process xr = (V, ,rt,p,t) has the property that the measure¬ 
valued process T) given by the collection {nt{Xt x {z} x ■),i G G}, is the IFV process on 
(Ali(K))®. 

Remark 1.15. From Section 11.41 onward, we will choose V = Z. However in the 
subsequent analysis, it is important to observe that we can embed Z into M and view 
as a closed subspace of U®, and view the IFV genealogy process as U®-valued process. 

1.3.2. Duality. The IFV genealogy process can be characterized by a dual process, 
the spatial coalescent. The formulation given here can also incorporate mutation and 
selection (see |D(IP12p . 

For each n E N, let Sn denote the space of partitions of the set {1,..., n}, i.e., tt = {p; p E 
tt} E §n is a collection of disjoint subsets p C {1, ...,n}, referred to as partition elements 
(or blocks), such that tt)pg,rP = Moreover, let denote the space of marked 

partitions of the set {1, ...,n} with mark space V, i.e., we regard {(p,i;p); tt = {p; p E tt} E 
Sn, Up E V} E as a partition of {1,..., n} where each partition element is assigned a mark 

in V. Finally, let Sn := S))" x denote the space of historical marked partitions of the 
set {1, ...,n} with mark space V. That is, we regard ({(p,z;p); p E vr}, (rij)i<i<j<n) E Sn 
as consisting of a marked partition and a matrix of mutual distances. 

The dnal process {fCt)t>o will take values in S := UneN^« with the following dynamics: 
given a finite historical marked partition, independently every pair of partition elements 
with the same mark in V merges at rate 7 . Until a pair of partition elements merges, the 
marks migrate independently of each other on V according to a continuous time random 
walk with transition kernel d. After merging the marks of the two involved partition 
elements will move together forever. At time t, we define the genealogical distance rt{i,j) 
of i and j in {l,2,...,n} as 2min{t,Tjj}, where Tij is the hrst time that i and j belong 
to the same partition element. 
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For each n, 4> G 
X S„ ^ M with 


^( 2 ) 


and g G Cbb(y'^,'K), we define a duality function H : 

(1.21) H{X,K) -.= [ ^g/.vpe^} •g(il’')</'(i:’"(x) + r:'), 

J(XxV)\'^\ 


where df = {X,r,g) € U](", and /C = ({(p,^p); p G vr},£') G §„, and -if = (Pp(i))j=i,...,n, 
and r^{x) := (r(xp(i), Xp(j)))i<i<j<n, with p(f) being the partition element of tt containing 

i G ( 1 , 


Remark 1.16. Note that {H{■,)€) : /C G Sn,<^ G <7^°°R), 5 G Cbb{V^, n G N} 
is law-determining and convergence-determining on U]^. 

The IFV genealogy process is dual to the coalescent {JCt)t>o, and its law and 

behavior as t ^ 00 can be determined as follows. 


Theorem 1.17 (Duality and longtime behaviour). The following properties hold for the 
IFV genealogy process {X^^)t>o: 

(a) For every G Uj(" and ICq G S, we have 

(1.22) E[H{Xr,)Co)] = E[H{Xi^,)Ct)], t > 0. 

(b) Ifa{-, •) = ^(a(-, •) -|- a{-, •)) is recurrent, then 

(1.23) £[TfV]^F G Ali(Un, 

t —^00 

where F is the unique invariant measure of the process on 

Remark 1.18. If a is transient, then we can decompose = {Xt,rt, gt) in such 

a way that X^ is the countable union of disjoint sets XI, pit is the sum of measures g\, 
and {{Xl,rt\xixxi^ are V-mmm spaces such that for x G Xl,x' G X^ with i ^ j, 

rt{x,x') diverges in probability as t ^ 00 , and each {Xl,rt\xixxii T\) converges in law to 
a limiting V-mmm space. Alternatively we can transform distances r : r ^ 1 — e“'’ and 
obtain a unique equilibrium in that case. See also |GM] . 

Remark 1.19 (Strong duality). As the interacting Fleming-Viot process is a population 
model, its evolving genealogy {Ut,rt, gt)t>o can be represented by a V-marked R-tree 
(A, r). That is, we can find a 0-hyperbolic metric space (with distances 00 possible equal 
to) (A, r) such that any two x, y G A of finite distance are connected by a path, and 
isometries {(pt)t>o with (pt : Ut ^ X such that for all x G A there is a t > 0 with 
(/j)r^(x) G supp(/it(- X V)). We won’t write down this R-tree representation explicitly 
here but it can be derived from the look-down construction presented in [GLWn5] in a 
straightforward way. 

We would like to point out that this representation by an R-tree allows to define the 
ancestor Aj of x at time T back at time T — s, and the above duality relation can actually 
be stated in a strong sense. That is, we can construct for each T > 0 our model together 
with a dual process on the same probability space such that for all X' C Ut with 

f*x'\ 

#A' <oo,^G C^°°(R; " GR), and g G Cbb{V*^ ,R), and H = from (fmi) . for 

all s G [0, T], 

(1.24) H(Ut-s,^J) = H(Ut,Xq), almost surely. 

Notice that whenever a duality relation holds almost surely rather than in expectation, 
one refers to it as a strong form of duality. 
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Indeed, define for fixed T > 0 and any finite subset X' := {xi, C Ut the map 

:= := (({(p,?P; P G -^J},r/))se[o,T] which sends := [Ut := {Ut,rt, fit))t>o to 

a path with values in the space of historical marked partitions defined by 

(1.25) = r{(pTixi),(pTixj)) A (T- s), 

for all 1 < i < j < #X', a marked partition of { 1 , 2 ,n} defined through the equivalence 
relation 

(1.26) i=Jj iff r'/{i,j) < {T - s), 

and a family of position functions on (/?'r(supp(/r'r(‘ x I^))) such that 

(1-27) e:’^(p) :=icr-s(v^?i,(p)). 

By construction, is the dual spatial coalescent. Moreover, for all s G [0,T], and 
{x,v) G {Xt X I/)l^ol, 

(1.28) (^)) + ’ 

and skt-s{A'^(x)) has under ht the same distribution as types under hts- This together 
implies the strong form of duality as stated in (11.241) . 


As a further consequence of relation (jl. 22 p we can specify the finite dimensional distri¬ 
butions of completely in terms of the spatial coalescent as follows. Fix a time 

horizon T > 0. The finite dimensional distributions are determined by the expectation of 
all test functions : (^([O, oo), M^) of the following form: 

( 1 - 29 ) ^{{Xr)t>o) := ), 

for some i & N, 0 < ti < t 2 < ■■■ < t^ = t and G of order Uk G N, 

(\ 

(t)k G and gu G CfefeCF^SM), for each k = 

The dual is the spatial coalescent with frozen particles (/Cs)sg[o,T] > for which the time 
index s G [0,r] runs in the opposite direction from the time index of . Namely, 
looking backward from time T, for each 1 < A: < .£, we start Uk particles at time T — tk 
at locations := (Ct\,i) ^ 4 ,rn.) G V"'^, each forming its own partition element in the 

partition vr G n := ni + ... + n£. The particles perform the usual dynamics of the spatial 

coalescent with the restriction that all particles starting at time T — tk were kept frozen 
before time T — tk- At time s, the genealogical distance rs{i,j) between two individuals i 
and j, started respectively at times T — ti and T — tj, is defined to be 

(1.30) 2min{s,Tij} — min{s, (T — ti)} — min{s, (T — tj)}, 
where Tij is the first time the two individuals coalesce. 

Denote this new spatial coalescent process with frozen particles by {Xt)t>o- The state 
space § is once more the space of historical marked partitions. We then define the duality 
function H : U]^ x S ^ M which determines the finite dimensional distributions of X for 
varying K, as in (| 1 . 21 l) but now with 

I i 

(1.31) f'i'L) •= J_ ((A,j )niH - \-nk-i<i<j<ni-\ - \-nk) Snd^f^U ) . J_ S'!; (fk i H *^)) 

k=l k=l 

where vr^ are the partition elements started at time tk- 

The following space-time duality is an immediate consequence of the Markov property 
and the duality applied successively to the time intervals ■■■i [OjD]- 
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Corollary 1.20 (Space-time duality). Let G VY , and JCq £ E> be as in (1? ) with 
tk I^T for all 1 < k < £. Let $ be defined as above /il.30\) . Then 

(1.32) =E[H{A:Y^,JCt)]. 

In words, the genealogical distances among the n = ni individuals sampled 

from (d4^^)se[o,T]) with individuals sampled at time tk at specified locations, can be 
recovered by letting these individuals evolve backward in time as a spatial coalescent until 
time 0, at which point we sample from according to the location of each partition 

element in the spatial coalescent. 

1.4. Genealogies of Continuum-sites Stepping Stone Model (CSSM) on R. If we 

rescale space and time diffusively, the measure-valued interacting Fleming-Viot process on 
Z converges to a continuum space limit, the so-called continuum-sites stepping stone model 
(CSSM) . Formally, CSSM is a measure-valued process u := {nt)t>o on R x [0,1], where R 
is the geographical space and [0,1] is the type space. We might think of individuals in the 
population which undergo independent Brownian motions, and whenever two individuals 
meet, one of the two individuals is chosen with equal probability and switches its type to 
that of the second individual. Provided that uo(- x [0,1]) is the Lebesgue measure on R, 
CSSM was rigorously constructed in [EF96( IEva97( lDEF+nn( iZhoOS] via a moment duality 
with coalescing Brownian motions. In particular, nfi- x [0,1]) is the Lebesgue measure on 
R, for all t > 0. 

In this subsection we will construct the evolving genealogies of the CSSM based on dual¬ 
ity to the (dual) Brownian web, and establish properties (Proposition [OSl Theorem ll.27h . 

For the voter model on Z, the joint genealogy lines of individuals at all space-time 
points in Z x [0, oo) are distributed as a collection of coalescing random walks evolving 
backward in time (see |Lig85| ). Analogously, for the CSSM on R, when n individuals are 
sampled from the population at possibly different times, their joint genealogy lines evolve 
backward in time as coalescing Brownian motions. Upon reaching time 0, each surviving 
genealogy line then independently selects an ancestral type by sampling according to the 
conditional distribution of uq on the type space [0,1], conditioned on the spatial location of 
the genealogy line. Furthermore the joint genealogy lines of individuals at all space-time 
points in R X [0, oo) are distributed as a collection of coalescing Brownian motions evolving 
backward in time. Therefore the CSSM on R is exactly the continuum analogue of the 
interacting Fleming-Viot process (as well as the voter model) on Z. 

Although having an uncountable number (starting from every space-time point in 
R X [0, oo)) of coalescing Brownian motions seems problematic, this object has been con¬ 
structed rigorously and is now known as the (dual) Brownian web W [FINROdl IFINR06] . 
dual to a forward Brownian web W constructed on the same probability space. The (dual) 
Brownian web is essentially a collection of coalescing Brownian motions on R, starting from 
every point in the space-time plane R x R. 

In [FINR04] . the Brownian web W is constructed as a random variable where each 
realization of 

(1.33) W is a closed subset of 11 := U<jgRC([s, oo),R), 

the space of continuous paths in R with any starting time s G R. In |FINR04] . the topology 
is defined by first compactifying the space R^ suitably, and then by choosing for 11 the 
topology of local uniform convergence and requiring the initial times to be close if paths 
are close. For each ^ := (x, t) G R, we will let yV{z) := W{x, t) denote the subset of paths 
in W with starting position x and starting time t. We can construct W by first fixing a 
countable dense subset P C R^, and then construct a collection of coalescing Brownian 
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motions {VV’(z) : z E P}, with one Brownian motion starting from each z ^ T). The 
Brownian web W is then obtained by taking a suitable closure of {VV(z) : z E P} in 11, 
which gives rise to a set of paths starting from every point in the space-time plane It 
can be shown that the law of W does not depend on the choice of T> (see Theorem B.l in 
Appendix [B|) . 

The Brownian web W has a graphical dual called the dual Brownian web, which we 
denote by W. Formally, 

(1.34) W is a random closed subset of 11 := UseRC'((—oo, s], M), 

the space of continuous paths in M starting at any time s E M and running backward in 
time as coalescing Brownian motions. 

The joint distribution of W and W is uniquely determined by the requirement that the 
paths of W never cross paths of W (see, Theorem B.3 in Appendix [B]). Thus, jointly, the 
Brownian web and its dual is a random variable taking values in a Polish space, with 

(1.35) (>v,>v)Enxn. 

Interpreting coalescing Brownian motions in the (dual) Brownian web as ancestral lines 
specifying the genealogies, we can then give an almost sure graphical construction of the 
CSSM, instead of relying on moment duality relations as in [EF961 [Eva97l iDEF+OOi I^o03| . 
which nevertheless we get as corollary of the graphical construction. The classical measure¬ 
valued CSSM process can be recovered from by projecting the sampling measure 

{h?^)t>o to the mark space V, if V is chosen to be the product of geographical space M 
and type space [0,1]. In what follows, we will take V to be only the geographical space 
M, since types have no influence on the evolution of genealogies. 

We next explicitly construct a version of the CSSM genealogy process 

(1.36) := (AP)i>o, = (xp,rfS,/iP), t>0, 

as a functional of the (dual) Brownian web (VV’,W). 

To avoid a disruption of the flow of presentation, background details on the (dual) 
Brownian web we will need are collected in Appendix iBl 

We proceed in three steps: 

Step 1 (Initial states). Assume that A^® belongs to the following closed subspace of 
U®: 

(1.37) U® := {(A, r,/i) E U® : /i(A x •) is the Lebesgue measure on M}. 

In other words, U® is the set of M-marked ultrametric measure spaces where the projection 
of the measure on the mark space (geographic space) M is the Lebesgue measure. This is 
necessary for the duality between CSSM and coalescing Brownian motions. We will see 
that almost surely E Uf for alH > 0. 

Step 2 (The time-t genealogy as a metric space). To define for every 

t>0, let us fix a realization of {yV,yV), (see the Appendix [B] for more details). For each 
t > 0, let 

(1.38) At := {n E M : W{v,t) contains a single path /(^,^t)}, Et := M\At. 

By Lemma lB.41 on the classification of points in w.r.t. W and W, almost surely, Et 
is a countable set for each t > 0. For each v G At, we interpret f{v,t) the genealogy 
line of the individual at the space-time coordinate {v,t). Genealogy lines of individuals 
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at different space-time coordinates evolve backward in time and coalesce with each other. 
At time 0, each genealogy line traces back to exactly one spatial location in the set 

(1.39) E := {/%,,) (0) :vGR,s>0J GW}, 

where we note that E is almost surely a countable set, because by Theorem IB. II and 
Lemma [8.21 paths in W can be approximated in a strong sense by a countable subset of 
paths in W. For each v G E,we then identify a common ancestor ^(u) for all the individuals 
whose genealogy lines trace back to spatial location v at time 0, by sampling an individual 
^(u) G according to the conditional distribution of /Iq® on conditioned on the 

spatial mark in the product space X^® x M being equal to v. 

We next characterize individuals by points in space. Note that there is a natural ge¬ 
nealogical distance between points in At. For individuals x,y G At, if f(x,t) and f(y^t) 

coalesce at time f < t, then denoting u := f(x,t){^) and v := f(y^t)i^)i define the 
distance between x and y by 

2{t — f) if f > 0, 

2t + r^^{i{u),i{v)) ifr<0 . 

First define (Xp®,rp®) as the closure of At w.r.t. the metric rt defined in (|1.40p . Note 
that is ultrametric, and by construction Polish. 

Remark 1.21. We may even extend the distance rt to a distance r between {x,t) with 
X G At and t > 0, and (y, s) with y G Ag and s > 0. More precisely, let 

s -\-t — 2 t if r > 0, 

s + t + r*^^{i{u),i{v)) iff<0 . 

Step 3 (Adding the sampling measure). We now define := ,rf^ , For 

that purpose, we will represent next as an enriched copy of M (see (ll.42p below). 

By identifying each x G At with the path f(^x,t) € W, we can also identify X^^® with the 

closure of {f(^x,t) G W : x G At} in 11, because a sequence Xn G is a Cauchy sequence 

w.r.t. the metric rt if and only if the sequence of paths f(xn,t) is a Cauchy sequence when the 
distance between two paths is measured by the time to coalescence, which by Lemma iB. 2 1 
is also equivalent to (/(a;„,t))neN being a Cauchy sequence in LI. 

When we take the closure of {f(x,t) G W : x G At} in LI, only a countable number of 
paths in W are added, which are precisely the leftmost and rightmost paths in W{x,t), 
when W{x,t) contains more than one path. 

Namely for each x G Et (recall from (|1. 381) 1. let denote the two copies of x 

obtained by taking limits of Xn G At with either Xn i x or Xn \ x in M, and let Ef- := 
{x^ : x G Et}. We can then take 

(1.42) Xf^ -.= AtiJ Et yj Et , 

equipped with a metric rp®, which is the extension of rt from At to its closure Xp^, giving 
a Polish space (X/"®,r)"®). 

Next to get a sampling measure, note that each finite ball in {Xp^,rt^) with radius 
less than t can be identified with an interval in M (modulo a subset of Et U Et U E }~), and 
hence can be assigned the Lebesgue measure of this interval. That is, we define the Borel 
measure on (Xp^, rp^) by 

(1.43) tp^({x' G Xp^ : rp^(x',x) < 5}) := R{x,6) — L{x,6), 6 <t,x G Xp^ 


(1.41) 


r{ix,t), {y,s)) : = 


(1.40) 


rt(x,y) := 
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if {x' G : rp^(x',x) < (5} = {L{x,6), R{x,6)) C M. We then define the sampling 
measure on B{{Xp^,r^^) x (M, dguci)) by 

(1.44) fj,f^{dxdv):='jlf^{dx)dxidv). 

This completes our construction of the CSSM genealogy process {Xp^)t>o. 

Remark 1.22 (Notational simplification). In the sequel we will apply the existence of a 
mark function and the embedding of the basic set in the enriched reals to simplify notation. 
Namely, we will write integrals w.r.t. the sampling measure over X x V as an integral 
w.r.t. the Lebesgue measure overM. 

A key feature is again duality. We can replace in the Definition of the process {lCt)t>o 
from Subsubsection 11.3.21 the random walks by Brownian motions to obtain (/C®'^)t>o, 
which is dual to the process {Xf^)t>o by construction: 

Corollary 1.23. (H-duality) The tree-valued CS-process and marked tree-valued coalesc¬ 
ing Brownian motions are in H-duality, i.e., for each H of the form il.21\) . 

(1.45) 

Furthermore by the above construction strong duality holds. □ 

Remark 1.24. Note in the continuum case the function a®’’) is not a poly¬ 

nomial since we fix the locations we consider. In order to get a polynomial we need to 
consider a function g on mark space with g G M) over which we integrate w.r.t. the 

sampling measure. 

We collect below some basic properties for the CSSM genealogy processes that we just 
constructed. 

Proposition 1.25 (Regularity of states). Let := be the CSSM genealogy 

process constructed from the dual Brownian web W, with G U®. Then almost surely, 
for every t > 0: 

(a) There exists a continuous (mark) function Kt : X^^ M, i.e., //p®(dxdu) = 
/rP(dx X M)5«.,(^)(di;); 

(b) For each i G {0,t), X^^ is the disjoint union of balls {B()i^z of radius i. Fur¬ 
thermore, there exists a locally finite set Ef := {vi}i^z C M with Uj_i < vi for all 
z G Z, such that {k{x) : x G Rf} = [vi-i,Vi] and fif^{Bf) = Vi — Uj_i. We can 
identify Ef from W by 

(1.46) El := {x G Rt : and f^^ ^^ coalesce at some time s <t — i}, 

where and are respectively the rightmost and leftmost path in W{x,t); 

(c) = {Xp^ ,rD^, G U®, i.e., pLf^{Xp^ x •) is the Lebesgue measure on E. 

Remark 1.26. Using the duality between the Brownian web W and W, as characterized 
in Appendix IHl it is easily seen that we can also write 

(1.47) eI = {f{t) : f G W{x,t — s) for some x G E, s > /}. 

Theorem 1.27 (Markov property, path continuity, Feller property). Let be as in 
Proposition \1.25[ Then 

(a) {Xp^)t>o is a -valued Markov process; 

(b) Almost surely, Xp^ is continuous in t > 0; 









CONTINUUM SPACE LIMIT OF IFV GENEALOGIES 


17 


(c) For each m G N, let be a CSSM genealogy process with G U®. 

If ^CS 

in U®, then for any sequence tm ^ t > 0, we have weak 
convergence in law, i.e. ^ , as m ^ oo. 

(d) For each initial state in U®, converges weakly to the unique equilibrium 

distribution on U® as t —)• oo. 

Remark 1.28. Notice that this equilibrium state can be represented in terms of a func¬ 
tional of the Brownian web. 

Remark 1.29. We note that if we allow types as well, we enlarge the mark space 
from M to M X [0,1], where each individual carries a type in [0,1] that is inherited upon 
resampling. Theorem II.271 still holds in this case. We will consider such an extended mark 
space in Theorem 11.321 below. 

Remark 1.30. Proposition 11.251 shows that even though can be any state in U®, 
for t > 0, can only take on a small subset of the state space U®. This introduces 
complications in establishing the continuity of the process at t = 0, and it will also be an 
important point when we discuss the generator of the associated martingale problem. 

1.5. Convergence of Rescaled IFV Genealogies. In this subsection, we establish 
the convergence of the interacting Fleming-Viot genealogy processes on Z to that of the 
CSSM, where we view the states as elements of U® (see Remark I1.15h . We assume that 
the transition probability kernel a(-, •) in the definition of the IFV process satisfies 

(1.48) ^^a(0,x)x = 0 and cr^ a(0, x)rr^ = a(0, G (0, oo). 

ojGZ icGZ icGZ 


For each e > 0, we then define a scaling map = Sf : U® —)• U® (depending on a) as 
follows. Let X = {X,r,fi) G U®. Then 

(1.49) SeX := (X,S,r,5,/x), 

where {Ser){x,y) := e^r{x,y) for all x,y £ X, and is the measure on V x M induced 
by p and the map (x, u) G X x M ^ (x, €a~^v), and then the mass rescaled by a factor of 
ecr“^. More precisely, 

(1.50) {Sep){F) = ea~^p{{x,e~^av) : {x,v) G F} for all measurable F C X x M. 


We have the following convergence result for rescaled IFV genealogy processes. 
Theorem 1.31 (Convergence of Rescaled IFV Genealogies). Let := {xf^’^)t 


H )t>o 

he an IFV genealogy process on Z with initial condition G Uf, indexed by e > 0. 

XV,e . T^CS vCS 


Assume that a(-,-) satisfies ^1.48^ , and S^Xq 
T hen (F^X^^Yf)t>o converges as C([0, oo), 
CSSM genealogy process X^ 


^CS , 


A Xg ® for some X^* G U“ as e ^ 0. 
\-valued random variable weakly to the 

0 . 


To prove Theorem 11.311 we will need an auxiliary result of interest in its own on the 
convergence of rescaled measure-valued IFV processes to the measure-valued CSSM. The 
IFV process with mark space M x [0,1] is a measure-valued process {X^^)t>o, where X/'^ 
is a measure on M x [0,1], and its projection on M is the counting measure on Z. Similarly, 
the CSSM with mark space M x [0,1] is a measure-valued process {Xp^)t>o, where X/^® 
is a measure on M x [0,1], and its projection on M is the Lebesgue measure on R. Define 
X^^ and X^^ respectively as the projection of the measure component of X^^ and X'^®, 
projected onto the mark space R x [0,1]. We then have 
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Theorem 1.32 (Convergence of Rescaled IFV Processes). Let := he a 

measure-valued IFV process on Z, indexed by e > 0. Assume that a(-,-) satisfies 
and converges to w.r.t. the vague topology for some as e ^ 0. Then 

(SeK - 2 ^)t>o converges as (^([O, oo), A1(M x [0,1]))-valued random variable weakly to the 
CSSM process := {X['^)t>o as 0. 

A similar convergence result has previously been established for the voter model in |AS11| . 

Remark 1.33. As can be seen from the above convergence results and the regularity 
properties of the limit process in Proposition 11.251 on a macroscopic scale, there are only 
locally finitely many individuals with descendants surviving for a macroscopic time of 5 
or more. This phenomenon leads in the continuum limit to a dynamic driven by a thin 
subset of hotspots only. For similar effects in other population models, see for example 
|DEF+n2b[rDEF+n2a] . 

1.6. Martingale Problem for CSSM Genealogy Processes. In this section, we show 
that the CSSM genealogy processes solves a martingale problem with a singular generator. 
To identify the generator for the CSSM genealogy process {Xf^)tyQ, we note that for 
all t > 0, Xp^ satishes the regularity properties established in Proposition 11.251 We will 
see that is only well-defined on G 11^’^ evaluated at points X G U® with suitable 
regularity properties for and X. 

We now formalize the subset of regular points in U® as follows, which satisfies exactly 
the properties in Proposition 11.251 

Definition 1.34 (Regular class of states U®). Let U® denote the set of X = (A, r, /x) G U® 
which satisfies the following regularity properties: 

(a) X G U®, i.e., /u(A x •) is the Lebesgue measure on R; 

(b) there exists a mark function it : A —>■ R, with fi{dxdv) = /x(dx x R)5^(2,)(du); 

(c) there exists S > 0 such that for each I G (0,5), A is the disjoint union of balls 

{Bl)i^z of radius 1. Furthermore, there exists a locally finite set := C R 

with Vi-i < Vi for all z G Z, such that {k{x) : x G B\} = and fJ,{Bl) = 

Vi - Vi_i. 

By Proposition 11.251 U® is closed under the dynamics of (i-e., G U® implies 
that Xf^ G U® for all t > 0), separable, metric measurable subset of the Polish space U®. 
However, it is not complete. Note that the hrst requirement gives rise to a closed set, the 
second requirement is known to generate a measurable set [KL15| . and it is not hard to 
see that the third requirement also generates a measurable set. 

Remark 1.35. Similar to the discussion leading to (ll.42p . for A G U®, we can give a 
representation on an enriched copy of R as follows. Property (c) in Dehnition 11.341 implies 
that any two disjoint balls in A are mapped by k to two intervals, which overlap at at 
most a single point in for some I > 0. Therefore k~^{x) must contain a single point for 
all X G R with x not in 

(1.51) E:=lJiyoE\ 

and K~^{x) containing two or more points implies that x is in k{Bi) H k{B2) for two 
disjoint balls in A. By the same reasoning, for each x G E, k“^(x) must contain exactly 
two points, which we denote by x^, where x~^ is a limit point of {^“^(tc) : w > x} and 
x~ is a limit point of {/z“^(t(;) : w < x}. Similar to (11.421) . we can then identify A 
with (R\E) U E~^ U E~ , where E^ := {x^ : x G E}. With this identification and with 
(ll.44p . we can simplify our notation (with a slight abuse) and let p, be the measure on 
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(R\E) U E~^ U E , which assigns no mass to E^ and is equal to the Lebesgue measure on 
R\E. 

We now introduce a regular subset of 11^’^ needed to dehne the generator 

Definition 1.36 (Regular class of functions Let IlJ’^ denote the set of regular test 

functions defined as in il.8\) . with the property that: 

(1.52) 3(5>0 s.t.'^i ^ j £ {1,2,-■■ ,n}, -^^{{rk,i)i<k<i<n) = 0 VrjjG[0,5]. 

Orij 

We can now specify the action of the generator on regular ih evaluated at regular 
points, namely L^{X) exists for G and X = {X,r,ia) G U®. By Remark 

11.351 we can identify X with (R\E) U E~^ U E~, while p, is identified with the Lebesgue 
measure on M.\E. The generator is given by 


(1.53) := L^®$(T’) + lP$(<T) + lP$(T’), 

with the component for the massflow (migration) of the population on R given by 

(1.54) L^®$(T’) := i / (t){r)Ag{x)dx, 

^ Jx^ 

and the component for aging of individuals given by 

d(j) 


(1.55) 


lPc^(T’) := 2 [ g{x) 


l<2<j'<n 


dr, 


-(r) dx. 




These operators are linear operators on the space of bounded continuous functions, Cb(U® 
with domain 11^’^, and maps polynomials to polynomials of the same order. 

The component for resampling is, with 9k,if> defined as in (I1.19p . given by 

(1.56) lP$(T’) ■■= f ^Mxk)=^{xi)} 9{x) {0k,i(t)-4>){'^ p*{dxk)tJ*{‘^xi) JJ drc*, 

with effective resampling measure and mark functions 


l<i<n 

i^kjl 


(1.57) + 

x^E x^E 

(1.58) k{x^) = x for X £ E, 
where E, and x^ for x £ E, are defined as in Remark 11.351 

Remark 1.37. Note that Lp® is singular. First because the effective resampling measure 
pi* is supported on a countable subset of X and is singular w.r.t. the sampling measure 
p. on. X. Secondly, p* is locally infinite because E n (a, b) contains infinitely many points 
for any a < b. Therefore the r.h.s. of (11.5611 is now in principle a sum of countably many 
monomials of order n — 2. 

Indeed, as we partition X = (R\i7) U E~^ U E~ into balls of radius /, with I f 0, the 
balls must correspond to smaller and smaller intervals on R so as not to contradict the 
fact that each point in X is assigned one value in R. Nevertheless, Lp®<h(lL) in (11.560 is 
well-defined at least on U® because by our assumption that ‘h G Ilr’^ and condition (ll.52p 
on (j), we have 6k,i4> = 4> d the resampling is carried out between two individuals x^ and 
x~ for some x £ E, with r{x~^,x~) < 6. Thus only resampling involving x £ E with 
r{x^,x~) > 5 remains in the integration w.r.t. p*, and such x are contained in the locally 
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finite set introduced in Definition 11.841 (c). Together with the assumption that g has 
bounded support, this implies that the integral in (|1.56l) is finite. 

Remark 1.38. The operator defined on functions in IlJ’^ evaluated at regular 
states in U® C U®, is still a linear operator, mapping polynomials to generalized poly- 
nomials of degree reduced by two and with domain Hr’ . Here, generalized polynomial 
means that they are no longer bounded and continuous, and are only measurable functions 
defined on the subset of points U® C U®. Hence LJ?® differs significantly from generators 
associated with Feller semigroups on Polish spaces. 

For Lp®<h(T’) to be well-defined for any 4> G H^’^, instead of <h € Hr’^, we need to place 
further regularity assumption on the point X at which we evaluate $. These assumptions 
are satisfied by typical realizations of the CSSM at a fixed time, as we shall see below. 

Definition 1.39 (Regular subclass of states U®). Let U® be the set of X = [X, r, g) G U® 
with the further property that 

(1.59) ^ r{x^,x ) < oo for all n G N, 

x£En[—n,n] 

where we have identified X with {M.\E) U E^ U E~ as in Remark \1. 3 5\ . 

Theorem 1.40 (Martingale problem for CSSM genealogy processes). 

(i) IfXo G U^, then the , 5 Xq)-' martingale problem has a solution, i.e., there 

exists a process X := {Xt)t>o with initial state Xq, with almost surely continuous 
sample path and with Xt G Uf for every t > 0, such that for all $ G H^’^, and 
w.r.t. the natural filtration, 

(1.60) (4>(Tt) - 4>(To) - J ( 1 ^CS^)(^^)^^^ is a martingale. 

(ii) The CSSM genealogy process constructed in Sec. \1.4\ is a solution to the above 
martingale problem. Apart from the properties established in Prop. \1.25[ for each 
t > 0, almost surely X[^^ G U® . 

Remark 1.41. Whether almost surely, Xp^ G U® for all t > 0, remains open. 

Remark 1.42. Note that different from the usual martingale or local martingale prob¬ 
lems, as for example in |EK86| . the test functions here are only defined on a (topologically 
not closed, only dynamically closed) subset of the state space. 

We conjecture that the martingale problems above are in fact well-posed. A proof could 
be attempted by using the duality between the CSSM genealogy process and the Brownian 
web. There are however subtle technical complications due to the fact that the generator 
of the martingale problem is highly singular. We leave this for a future paper. 

1.7. Outline. We provide here an outline of the rest of the paper. In Section [2] we prove 
the results on the IFV genealogy processes. In Section 3 we construct the CSSM genealogy 
process, and establish in Section 0] the convergence of the IFV genealogies to those of 
the CSSM, and in Section [5] results on the martingale problem for the CSSM genealogy 
processes. In Appendix we collect further facts and proofs concerning marked metric 
measure spaces. In Appendix |Bl we recall the construction of the Brownian web and its 
dual, and collect some basic properties of the Brownian web and coalescing Brownian 
motions. In Appendix O we prove some results on coalescing Brownian motions needed 
in our estimates to derive the martingale problem for the CSSM. 
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2. Proof of Theorems 11.121 and 11.171 

In this section we present the proof of the results on the evolving genealogies for the 
interacting Fleming-Viot diffusions. This model is a special case of evolving genealogies 
for the interacting A-Fleming-Viot diffusions which are studied in |GKWj . 


Proof of Theorem \1.12[ We will proceed in five steps: (1) We show the result on the 
martingale problem and the duality for finite geographic spaces V. (2) To prepare for the 
general case where V is countable, we define an approximation procedure with specific 
finite geographic space dynamics. (3) We then show the convergence in path space, as the 
finite spaces approach V. (4) We verify the claimed properties of the solution for general 
P by a direct argument based on the duality and an explicit look-down construction. (5) 
Finally, we show that the process admits a mark function. 

We will use several known facts on measure-valued Fleming-Viot diffusions. For that 
we refer to |Daw93l Chapter 5] for the non-spatial case and to [DGV95] for the spatial 
case. 

Step 1 {y finite) The case where V is finite is very similar to the non-spatial case. We 
therefore just have to modify the arguments of the proof of Theorem 1 in [GPW13] or 
Theorem 1 in [DGP12| . 

As usual we will conclude uniqueness of the solution of the martingale problem from a 
duality relation. Note that in contrast to the (non-spatial) interacting Fleming-Viot model 
with mutation considered in |DGP12| . in our spatial interacting Fleming-Viot model, 
resampling takes place only locally, that is, for individuals at the same site. The tree¬ 
valued dual will therefore be based on the spatial coalescent considered in [GLWn5| . As 
the calculations to verify the duality relation are the same as in |GPW13] and in |DGP12| , 
we omit them here. 

As for existence of a solution of the martingale problem we consider the martingale 
problems for the evolving genealogies of the approximating spatial Moran models. By 
consistency of the spatial coalescent, we get the uniform convergence of generators for 
free. Thus we only have to show the compact containment condition. Here we can rely on 
the general criterion for population dynamics given in Proposition 2.22 in [GPW13] . As 
V is finite, all arguments given in [GPW13] to verify this criterion simply go through here 
as well. 

Step 2 (A coupled family of approximating finite systems) Let now V be count¬ 
able, and consider a sequence (In)neN of finite sets with 14 V V, and 14 t Put for 
each n G N, and for all vi,V 2 G V, 


( 2 . 1 ) 


a-n ( vi , V2 ) 


a(vi,V2)lv„xVn('>’l,'>’2) 

<5(ui,U2), 


if Vi G 14 , 

if Vi 0 14. 


Denote then by a solution of the martingale problem associated with the operator 

(restricted to I 4 ) 

d 


^FV,U„^ 


T^''{d{x,v))g{v) f, - 

JiXxVP - 


( 2 . 2 ) 


-F / - g){v) 

J{XxVp 


+ 27 


'{Xxvy 


T^'^id{x,v)) g{v) ^ l{vk=ve}{0k,i4>- 


l<k<£<n 
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and the spatial coalescent on with migration rate a„(-, •) rather than a(-, •). 

Notice that a„ is not necessarily double stochastic anymore, which turns the duality 
with the spatial coalescent into a Feynman-Kac duality where the Feynman-Kac term 
converges to 1, as n —oo, on every finite time horizon. The Feynman-Kac duality reads 
as follows (compare, for example, [Seil4[ Proposition 3.11]): for all Tq € U]]^, 

(2.3) E[//(T™,/Cj^")] =E[i/(To,/C^)exp( f A{JCY-)ds)], 

Jo 

where 

(2-4) ^((tt,^',!)) “ !)• 

iGTT v'GV 

which is bounded along the path by |7r|- Const, for all t > 0. 

Establishing the Feynman-Kac duality requires to check that (compare. Section 4.4 in 
[EK861 Theorem 4.4.11]): 

(2.5) + A{K.) ■ 

This can be immediately verified by explicit calculation (compare, [GPW13( Section 4] 
for the generator calculation for the resampling part, and [Seil4( Proposition 3.11] for the 
generator calculation for Markov chains - here migration - whose transition matrix is not 
double stochastic). 

As for given n G N, our dynamics consists of independent components outside Vn, we 
can apply Step 1, here with the Eeynman-Kac duality, to conclude the well-posedness of 
the martingale problem with respect to 

Step 3 {y countable) Eix Xq G Uj(". In this step we want to show that the family 

{;t-FVy„. 

n G N} is tight, and that every limit satisfies the (L^^, 11^’^,(5A^(,)-martingale 

problem. 

Observe first that the family of laws of the projection of the measures on mark space 
are tight since the localized state w.r.t. a fixed finite subset A of K has only finitely many 
marks and weight jA] (uniformly in n). We therefore will here ignore the marks and show 
tightness in Gromov-weak^-topology. For that we want to apply [EK861 Corollary 4.5.2]. 
Since an{vi,V 2 ) a{vi,V 2 ) for all vi,V 2 G V, we clearly have that converges uni¬ 
formly to as Vn t 14, i.e., sup_:^^guv ^{X) \ —)• 0, for depending 

n—>-oo 

only on finitely many sites, it remains to verify the compact containment condition, i.e., 
to show that for every T > 0, and e > 0 we can find a compact set Kt^e V such that 
for all n G N, 

(2.6) G Kt,e; for all t G [0, T]) > 1 - e. 

For that purpose we will once more rely on the criterion for the compact containment 
condition which was developed in [GPW131 Proposition 2.22] for population dynamics. 
To see first that the criterion applies, notice that the evolving genealogies of interacting 
Fleming-Viot diffusions can be read off as a functional of the look-down construction given 
in |GLW05] . Thus the countable representation of the look-down defines a population 
dynamics. In particular, for each t > 0, we can read off a representative of 

xr in the look-down graph such that ancestor-descendant relationship is well-defined. 
Denote for all t > 0, s G [0,t] and x G W by At{x, s) G W-s the ancestor of x G W back 
at time s, and for P Xg hy Dt{J,s) C Xt the set of descendants of a point in J at 
time t. 

In the following we refer for each finite A C K to 

(2.7) ;t^FV,.,A 
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as the restriction of to marks in A, i.e., obtained by considering the sampling 

measure ^^{dxdv) := lAfJ-{dxdv). Fix T > 0. We then have two show that the following 
properties are true for all A C 1/, 

• Tightness of number of ancestors. For all t G [0,T] and s G (0,t), the family 

{S 2 ^{Xt,rt, Ht)] n G N} is tight, where Ht) denotes the minimal number 

of balls of radius 2e needed to cover Xt up to a set of ^t-measure e. 

• Bad sets can be controlled. For all e G (0,T), there exists a 6 = 6{s) > 0 

such that for all s G [0,T), n G N and u G [0, s])-measurable random 

subsets C Xs X A with 

(2.8) limsupP( sup ,s) x ^) > e) < e. 

nSN le[s,T] 

(i) Fix t G [0,T]. W.l.o.g. we assume that is a subgroup of V with addition +„ 
for each n G N. We consider for each n G N another spatial coalescent on Vn with 
migration kernel dn{',‘) rather than an{-,-) where 

(2.9) dn{v,v') := 

where denotes equivalence modulo Further, denote by the evolving ge¬ 

nealogies of the interacting Fleming-Viot diffusions whose migration kernel is dn{', •) rather 
than an{‘,’)- We prefer to work with K^’^. For this spatial coalescent it was verified in the 
proof of Proposition 3.4 in [GLW05| that for any time t > 0 the total number of partition 
elements of which are located in A is stochastically bounded uniformly in n G N. As 
the kernel dn{’, •) is double stochastic, and are dual (without a Feynman-Kac 

potential), and as as n ^ oo, the claim follows. 

(ii) Fix T > 0, A C F finite, e G (0,T), s G [0,T), n G N and a a{Xu^’^'^-, u G 
[0, s])-measurable random subset C Xg x A. From the generator characterization of 
;t^FV,I4 

, we can conclude that the process , s)); t > s} is a F-indexed system 

of interacting (measure-valued) Fisher-Wright diffusions. We have to find 5 = (5(e) such 
that (12.81) holds if < <5. 

Notice that {/Uj(F>]^" , s) x A); t > s} is a semi-martingale and given by a martingale 

with continuous paths due to resampling plus a deterministic flow in and out of the set A 
due to migration. Therefore we have to control the fluctuation of the martingale part and 
the maximal flow out of the set A over a time interval of length t — s. 

The martingale part is estimated from below with Doob’s maximum inequality (the 
quadratic variation is bounded uniformly in the state and in n by a constant -lAI, details 
are left to the reader). The deterministic out flow occurs at most at a finite rate c x |A| 
since the total mass of every site is one. This estimate is uniform in the parameter n 
(recall the random walk kernel is perturbed by restricting it to Vn)- Similarly the flow 
into A can be bounded by c • |A| independently of n but the flow out of the set A occurs 
with a maximal rate c independently of n with n > no (A), and 

(2.10) c = 2 -max| max a(v',v), max a(u',i;)|. 

•u'eA'=,-!;eA ^ 


Step 4 (Feller property). We next prove that X^^ has the Feller property. From here 
it is standard to conclude that X^^ satisfies the strong Markov property (see, for exam¬ 
ple, [EK861 Theorem 4.2.7]). Consider a sequence (Tq”^ )ngH in such that —)■ Xq, 


n —f-oo 
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(n) 

Gromov-weak'^ly, for some Xq € Uj^. Denote by and the evolving ge- 

(n) 

nealogies of the interacting Fleming-Viot diffusions started in Xq ’ and Xq^ respectively, 
and let K, be our tree-valued dual spatial coalescent, and H as in (ll.2ip . Then for each 
given t > 0, 

( 2 . 11 ) K[H[xi,"\K.t)\K.t]^^[H{Xo,K.t)\K.t], a.s. 

n —^oo 

Thus, by our duality relation, 


( 2 . 12 ) 


(n) 

,ICo)]=E[H{X^^\]Ct)] 

-^E[H{Xo,ICt)] =E[F(T’™°,/Co)]. 


Recall from Remark R:gs6 that the family /C); n G N, G G S„} is 

FV FV X 

convergence determining. Thus it follows that T) ’ ° T) ’ for all t > 0. 

n—¥oo 

Step 5 (Mark function) Fix T > 0, and Xq G 

For the proof we will rely once more on the approximation of the solution of the 
, 6^0 )-martingale problem by Uf()^-valued evolving genealogies of Moran mod¬ 
els, X^'P, where p > 0 is the local intensity of individuals. By the look-down construction 
given in [GLW05| . we can define the family {X^'P; p > 0} on one and the same probability 
space. Moreover, as the solution of the (L^^,n -martingale problem has continu¬ 
ous paths, due to Skorohod representation theorem we may assume that as 

p —>■ oo, uniformly for all t G [0,T], almost surely. 

We will relv on Theorem 3.9 in [KL15j. That is. as the solutions of the II^’ 

martingale problem have continuous paths almost surely, we have to construct for each 
hnite A CV, for each t G [0,T], and e,5,p > 0 a function ht^e,A £ 'H (recall from (11.13^ 1 
and a random measurable set ^ a — such that 

(2.13) hmsuplimsupP{pt(X^ \ Vt G [0,T]} > 1 - e. 

(54,0 p^oo ’ ’ ’ 


Assume hrst that the geographic space V is hnite. For the construction of such a 
function GTi and a random measurable set Y^^y = Y^^ C , we can proceed 

exactly as in the proof of Theorem 4.3 of [KL15] where the statement is shown with 
mutation rather than migration in the non-spatial rather than the hnite geographic space. 


Let now V be countable, and consider a sequence {Vn)neN of hnite sets with 14 C F and 
Vn t F. Gonsider for each n G N a solution, of the 11^’°,(l;to)-niartingale 

problem with as dehned in ()2.2p . As we have seen above, G for all 

t > 0, almost surely. Moreover, we have shown in Step 3 that each solution X^^ of 
the (L^^, n^’^, (5;t^(j)-martingale problem on F can be obtained as the limit of as 

n — >■ oo. To conclude from here that also Xf^ G for all t > 0, almost surely, hx a 
hnite set A C F. As done before we denote by x (• n A)))j>o 

and X^^A = [Xt,rt,l^t{’ x (• x A)))t>o the restrictions of X^^An A^^, respectively, 
to marks in A (compare ()2.7ll h 

For each m > n we couple X^^An ^FV,Um through the graphical lookdown con¬ 
struction by using the same Poisson point processes and marking every path which leaves 
Vn in the Fn dynamics by a 1. Moreover, we impose the rule that the 1 is inherited upon 
lookdown in the sense that both new particles carry type 1. The sampling measure of 
types then follows an interacting Fleming-Viot (in fact two-type Fisher-Wright) diffusion 
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with selection. The corresponding Moran models are coupled and converge in the many 
particle per site limit to a limit evolution, which is the coupling on the finite geographic 
spaces and the additional types act upon resampling as under selection. 

By construction, if x, x' G supp(^t(- x T)), their distance is the same in and 

ffFVTm both carry type 0. Thus for suitably large n (depending on e > 0) such that 
A CVn at any location in A the relative frequencies of types 1 at time t can be made less 
than any given e > 0 with probability > 1 — e by simple random walk estimates. Namely, 
if (Z^) t>o is a o(-, •)-random walk starting in 6 G 14 C Vm and b' G CVn, 

(2.14) F{Z^ ^ Vn for some t G [0, T],Z^e A) 

+F{Z^ G Vn for some t G [0, T], Z^ G T) < —)• 0 as n —>■ oo V m > n. 

Then the expected frequency of type 1 in locations in A is bounded by F{5m) with F{5) 

0 as 5 —7> 0, which follows from the properties of the Fisher-Wright diffusion with selection 
easily via duality. 

As a consequence the supremum along the path of the difference in variational norm of 
the distance-mark distributions for the 14, and the 14i-evolution for types in the set A can 
be bounded by a sequence converging to 0 as n, m ^ oo. 

Therefore also the limit dynamics on countable V has a mark function. I 


Proof of Theorem | i. iTj Let be the evolving genealogies of interacting Fleming-Viot 
diffusions where we have assumed that the symmetrized migration is recurrent. In order 
to prove ergodicity we proceed in two steps: (1) We start with constructing the limiting 
object which is tree-valued spatial coalescent. (2) We then prove convergence of to this 
tree-valued spatial coalescent, at t — >■ oo, for any initial state Xq G U]^. This immediately 
implies uniqueness of the invariant distribution. 

Step 1 (Tree-valued spatial coalescent) Recall from |GLWr)5] the spatial coalescent 
started with infinitely many partition elements per site and with migration mechanism 
d{v,v'). If the symmetrized migration is recurrent, we can assign to each realization a 
marked ultrametric space, JC = {K,r), which admits a mark function. In order to equip it 
with a locally finite measure on the leaves, we consider a coupled family of sub-coalescents 
{JC^, /9 > 0} such that the number of points of a given mark is Poisson with intensity p. If 
we now assign each point in mass p~^, then it follows from [GLWOSl Theorem 3] that 
there exist a measure p on K xV such that for each u G P, p{K x {u}) = 1. This reflects 
the spatial dust-free property. Thus, we can use the same arguments used in |GPW09[ 
Theorem 4] to show that the family {(/C^, p~^ v)eKPxV and in fact has 

exactly one limit point, 

(2.15) JC^ := {K,rt,pi) 


Step 2 (Convergence into the tree-valued spatial coalescent) For all Xq G and 

/Co G S, by our duality relation, using the functions H = from (jl.2ip . 


E[R-(TfV,/Co)] =E[iL(To,/Ci)] 

t—^oo 

= E[H{}C\}Co)]. 


(2.16) 













26 


ANDREAS GREVENi, RONGFENG SUN 2, ANITA WINTER ^ 


Once more, as the family /C); n € N, (/) G Cbb(^+ ),}C € S} is convergence 

determining by Theorem ll-Sl ii , we can conclude that for all initial conditions, 
converges Gromov-#-weakly to the tree-valued spatial coalescent. I 


3. Proofs of the properties of CSSM Genealogy Processes 

In this section, we prove Proposition 11.251 and Theorem 11.271 bv using properties of the 
double Brownian web (W, W), which was used to construct the GSSM genealogy process 
in Section [H 

Proof of Prop. 11.251 (a) : The existence of a mark function k : —)• M follows by 

construction (recall (jl.44p where the mark is explicitly given). The continuity of k follows 
from the property of the dual Brownian web W. More precisely, if ^ x in 
then identifying Xn and x with points on M, it follows that n for some 

path ^ kV(xn,t) and f(^x:,t) ^ which implies that ^ x in M. 

(b) : By (ll.42p . we identify Xp^ with M, where a countable subset is duplicated. The 

distance between x,y £ Xp^ is defined to be twice of the time to coalescence between the 
dual Brownian web paths f(^x,t) ^ W(x,t) and f(y^t) ^ if the two paths coalesce 

above time 0. Therefore for I G (0,t), each ball B\ of radius I correspond to a maximal 
interval [uj_i,Ui] C M, where all paths in {f(^x,t) ^ W(x,t) : x G (uj_i,Ui)} coalesce into 
a single path by time t — 1. The collections of such maximal intervals (ui_i,Uj) form a 
partition of where = {vi : i G Z} is exactly the set defined in (11.4611 . 

(c) : Fix an I G (0,t). By construction, for each ball BI of radius I, k{B\) = [ui_i,Uj] is 

assigned the Lebesgue measure on M. Together with (b), it implies that yp^{Xp^ x •) is 
the Lebesgue measure on M. I 


Proof of Theorem ll.271 Let us hx a realization of the Brownian web W and its dual W, 
and let be constructed from W as just before Proposition 11.251 By ()1.42p . for each 
t > 0, we can identify Xp^ with At U Ep U E^, where At and Et are defined as in (|1.38l) . 
To simplify notation, we will drop the superscript GS in the remainder of the proof. 

(a) : The Markov property of {A!t)t>o follows from the Markov property of W and W. 

More precisely, if we denote by /|* the restriction of a path / G 11 to the time interval 
[s, t], and K\l := {f\l : f G K} for a set of paths iL C 11, then >V|o is independent of Wl^ 
for each s > 0. The same is true for W since VV|q and VV|q a.s. uniquely determine each 
other by Theorem IB.31 The Markov process {A!t)t>o is time homogeneous because VV|o is 
equally distributed with apart from a time shift. 

(b) : Let Xq G U®. We first prove that {Xt)t>o is a.s. continuous in t > 0. To accomplish 
this, since 11^’^ is convergence determining in U® as shown in Theorem 11.81 it suffices to 
show that for any := ^"-’^’9 £ 11^’^, the evaluated polynomial 

(3.1) $(Tt) = f (j){n)g{x)dx 

JR^ — 

is continuous in t, where x := (xi,..., Xn), r* := {rt{xi, Xj))i<icj<n, and given the identifi¬ 
cation between Xt and At U Ep U E^ and the definition of k, we have replaced integration 
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w.r.t. fj,t on Xt X M by integration w.r.t. the Lebesgure measure on M. By (ll.4n[l . for 
Lebesgue a.e. Xi,Xj G M, we have 


(3.2) 


rt{xi,Xj) = 


2{t - f) 

2t + ro{^{u),^{v)) 


if f > 0, 
if f < 0, 


where f is the time of coalescence between f(^x,t) and /(y,t), = u, and /(x,t)(0) = v. 

By Lemma IB.41 for each t > 0, W{x,t) contains a single path for all but a countable 
number of x G M. For such x, by Lemma lB.2l the time of coalescence between fi^x,s) and 
tends to t as s —>■ L and hence lirng^^ r((x, s), (x, t)) = 0, where r((x, s), (x, t)) is 
defined in (|1.41l) and extends the definition of rt{x,y) to individuals at different times. 
Since 


(3.3) 


\rs{xi,Xj) -rt{xi,Xj)\ < r{{xi, s), {xi,t)) + r{{xj,s),{xj,t)), 


it follows that when t > 0, for Lebesgue a.e. Xj, Xj G M, 1 < f j < n, we have 


(3.4) 


limrs(xj,Xj) =rt{xi,Xj). 


S — 


We can then apply the dominated convergence theorem in (j3.ip to deduce that, for each 
t > 0, a.s. 


(3.5) 


lim4>(Al,) = $(Ali). 

s^t 


This verifies that is a.s. continuous in t > 0. 

Proving the a.s. continuity of (T’t)t>o at t = 0 poses new difficulties because Xq can be 
any state in U®, while for any t > 0, Tt is a regular state as shown in Proposition 11.251 
We get around this by showing that X admits a cadlag version. More precisely, we invoke 
a part of the proof of the convergence Theorem 11.311 that is independent of the current 
proof. Note that for any Xq G U®, we can find a sequence G Uf , indexed by e > 0, 

such that S^Xq ’ Xq. Indeed, we only need to approximate the mark space M by eZ 
in order to construct S^Xq from Xq. In the proof of Theorem 11.311 it is shown that 
the corresponding sequence of interacting Fleming-Viot genealogy process (5'eT’^^^’'^)t>o 
is a tight family of Z1([0, oo),U®)-valued random variables, where Z1([0, oo),U®) denotes 
the space of cadlag paths on U® equipped with the Skorohod topology. Furthermore, 
(5.<- 2 ^ )t>o converges in finite-dimensional distribution to the CSSM genealogy process 
{Xt)t>o- Therefore, {Xt)t>o must admit a version which is a.s. cadlag, with T) — Xq as 
11 0. Since we have just shown that the version of {Xt)t>o constructed in Sec. 11.41 is a.s. 
continuous in t > 0, it follows that the same version must also be a.s. cadlag, and hence 
continuous at t = 0, which concludes the proof of part (b). 


(c): To prove the Feller property, let X^ 

,(m) 


(m) 


Xq in U®, and let tm —^ i > 0. To show 


X^^"’ => T’t, by Theorem 11.81 it suffices to show 

(3.6) lim E[4'(T’/™^)] = E[4'(T’t)] V 4> = G 

m—^oo 

We claim that the convergence in 

(3.7) hmE[<h(T’s)] = E[‘I>(T))] is uniform w.r.t. the initial condition Xq. 

s—>-t 

In particular, as m —>■ oo, 
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To prove (j3.6[l . it then suffices to show that 

(3.8) lim =E[^>(Tt)]. 

m^oo 

We prove (13.7p and (13.8p next. 

Proof of (j3.7l) . By the Markov property of X, it suffices to show that as t 0, 

(3.9) |E[$(Tt)] - E[$(To)] I ^ 0 uniformly in Tq. 

Note that we can write 

(3.10) E[d>(T’o)] = / g{x)K[(p{{ro{f,{xi),C{xj))}i<i^j<n)]dx, 

where for each x G M, ^(x) G Xq is sampled according to the conditional distribution of 
/iQ on Xq, conditioned on the spatial coordinate in Xq x M being equal to x. On the other 
hand, 

(3.11) E[d>(T))] = / g{x)¥.[4>{{rt{xi,xj)}i<icj<n)]dx. 

Let F{x, t) denote the event that the dual Brownian web paths f(xi,t)^ • • •; f{xn,t) do not 
coalesce during the time interval [0, t]. We can then partition E[(/)({rt(xj, Xj)}i<j<j<„)] 
into expectation restricted to F{x,t) and F^{x,t) respectively. On the event F{x,t), we 
can replace f(xi,t)j ^ < i < n, by independent Brownian motions {xi{s))s<t, ^ < i < n, 
starting respectively at Xj at time t and running backward in time. Then 

(3.12) rt{xi,Xj) = 2t+ ro(|(xi(0)),^(xj(0))). 

Let F{x,t) denote the event that {xi{s))s<t, I < i < n, do not intersect during the time 
interval [0,t]. Then using (13.121) . we can rewrite (|3.11l) as 

(3.13) 

E[d>(T’t)] = [ g{x)E[4>{n)lF‘^(x,t)]dx+ [ 5(x)E[(/>({2t + ro(C(xi(0)),^(xj(0)))}i<i<j<n)] dx 

JR" — JR" 

- [ fi'(^)E[(?(>({2f+ ro(C(2:i(0)),C(a;j(0)))}i<i<j<n)ljj;,. ^Jdx. 

■Jr" ’ 

Note that for Lebesgue a.e. xi, ... ,Xn, P(F'^(x, t)) = E(F'^(x, t)) — 0 as t | 0. Therefore 
by the bounded convergence theorem, the first and third term on the right hand side of 
(I3.13[) converges to 0 as t Oj uniformly in Xq. For the second term on the right hand side 
of ()3.13p . we can make the change of variable yi := Xj(0), yi(s) := Xj(s), to rewrite it as 

(3.14) [ E[g{yi{t),..., yn{t)) </>({2f + ro(C(2/i), C(2/j))}i<i<i<n)] dy. 

JR" 

For each y G M”", clearly the quantity inside the expectation converges a.s. to the analogue 
in (I3.1UD as t i 0, and the speed of convergence does not depend on Xq. Therefore the 
expectation in (I3.14h also converges, uniformly in Xq. Using the fact that g has bounded 
support, while g and (p are both bounded, we can easily dominate the integrand in (13.1411 
w.r.t. dy by an integrable function as t 0; (13.9h . and hence (13.7h follows. 

Proof of (j3.8jl . For each Xi G M, 1 < z < n, let us denote f{xi,t) £ VV(xi,t) by /*. Then 

(3.15) E[$(t/™^)] = f g{x)E[(l){rl^^)]dx, 

Jr" — 

where (xj, Xj)}i<j<j<n depends on the realization of {fi)i<i<n- Let r be the 

smallest time in [0, t] when a coalescence occurs among the paths {fi)i<i<n- Let Ii,..., /^ 
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denote the partition of {1,..., n}, where all fi with i in the same partition element Ij have 
coalesced into a single path at time f. Conditioned on (/j)i<i<n on the time interval [f, t], 
the distribution of the remaining k coalescing Brownian motions on the time interval [0, f] 
is then given by the distribution of k Brownian motions conditioned not to intersect on 
the time interval [0, f], and their positions at time 0 has a probability density in which 
we denote by Note that conditioned on (/i)i<i<n on the time interval 

[f,t] and their positions vi,...,Vk at time 0, (j){{rl'^\xi,Xj)}i<i<^j<n) only depends on 
^(^i))}i<j<i<A:) of. (|3.2p . We can therefore write 

:= E[(/)(^M) I ifiis)),<i<n,se[r,t]] 

(3.16) r . . 

= / 9f ■ ■ ■ ,Vk)H4>f ^{ro {C{vi),^{vj))}i<i<j<k)]dv, 

where given the realization of (/j(s))i<j<n,se[f,t] and {fii^)}i<i<n = • • ■,Vk}, 

(3.17) <^/>({?’o”'^(?(^^i),C(^'i))}i<i<j<fc) = 4>{{ri'^\xi,Xj)}i<i<j<n). 

Note that . is a polynomial of order k on U®, defined from the bounded continuous 
functions . and 4>fr-: except that g^ ~ does not have bounded support. Nevertheless, 
gj: . is integrable and can be approximated by continuous functions with bounded support. 

Therefore from the assumption Xq in U®, we deduce that -(T’q™'^) —)■ .(Tq) 

for Lebesgue a.e. xi ,... ,Xn and a.e. realization of (/i('S))i<j<„^se[•?,*]■ H then follows from 
the bounded convergence theorem that 

(3.18) E[<h(T'/™))] = / < 7 (x)E[cf> -(T^o^™))]dx ^ [ g{x)E[<^> f .{Xo)]dx = EMXt)], 

which concludes the proof of the Feller property. 

(d): This follows readily from the construction of the CSSM genealogy process Xp^. 
For any > 0, the genealogical distances among individuals with spatial locations in 
[—N, N] are determined by coalescing Brownian motions in the dual Brownian web W. 
The initial condition affects the genealogical distances only on the event that the 
backward coalescing Brownian motions starting from [—iV, iV] at time t do not coalesce 
into a single path by time 0. As t —>■ oo, the probability of this event tends to 0, and 
therefore {Xpp.)s>o converges in distribution to the CSSM genealogy process constructed 
from W as in Section oi with initial condition being at time — oo. ■ 


4. Proof of convergence of Rescaled IFV Genealogies 

In this section, we prove Theorem 11.311 that under diffusive scaling of space and time 
as well as rescaling of measure, the genealogies of the interacting Fleming-Viot process 
converges to those of a CSSM. In Section [4.11 we prove f.d.d.-convergence, and in Section 
14.21 tightness in path space. In Section [4.31 we prove Theorem 11.321 on the measure-valued 
process, which is needed to prove tightness in Section [421 

As in Theorem 11.311 let {Xp^''^)t>o = (vf''^’'^, be the family of IFV 

genealogy processes on Z indexed by e > 0, such that € U® as e ^ 0, and 

let (ACS) t>o = be the CSSM genealogy process with initial condition 

vCS 

^0 ■ 
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4.1. Convergence of Finite-dimensional Distributions. In this subsection we prove 
the convergence {SeX^y 2 ^)t'>o ^ in finite-dimensional distribution, i.e., 

(4.1) ,..., ^ V 0 < ti < t2 < • • • < 4, 

where ^ denotes weak convergence of (U®)*^-valued random variables. By |EK 86 l Prop. 3.4.6] 
on convergence determining class for product spaces, it suffices to show that for any 
:= g n^’^, 1 < i < k, we have (recall (ll.Sp i 

k k 

(4.2) 

i=l i=l 

For notational convenience we assume first that the initial tree is the trivial one (all 
distances are zero) and we shall see at the end of the argument that this easily generalizes. 
We will first rewrite both sides of this convergence relation in terms of the dual coalescents 
and then apply the invariance principle for coalescing random walks. 



Step 1 (Claim rephrased in terms of coalescents). We can by the definition of 
the polynomial in (13.ip rewrite the left hand side of (14.21) as 


E 


K 


e 


(4.3) 


2 = 1 
k 


)n( 


= £1 I gi{e^ 

*=1 x{,...,x’^GZ 


- 1 / 




PV 6 

where for each time ti, we sample rii individuals in W^- 2 ). at respective spatial positions 
ixl)i<a<ni, with 

(4.4) itiixa) being sampled from according to fyY 2 ^.{-\xa), 

PV € PV c 

the conditional distribution of u _ 2 , on X _ 2 , conditioned on the spatial mark being equal 
to Xa G E. 

By the space-time duality relation ()1.32l) for the IFV genealogy processes, every sum¬ 
mand of the R.H.S. of ()4.3p can be calculated in terms of coalescing random walks. Namely, 
the joint law of the space-time genealogies of the sampled individuals (x^) G 
1 < a < Uj and 1 < i < /c, is equal to that of a collection of coalescing random walks 
{Xa{y)s<e-^ti (recall here time s runs backwards), starting respectively at at time 
where each walk evolves backward in time as rate 1 continuous time random walk on Z 
with transition probability kernel a, and two walks at the same location coalesce at rate 
7 . From the duality relation we get the following the stochastic representation: 


(4.5) 


„FV,e 


ix,y) 


2{s — t) if r > 0 , 

2 s + r^^’Y^oMyo(x)) ifr< 0 . 


where f denotes the time of coalescence between the two coalescing walks starting at 
X, y G Z at time s, while u, v are the positions of the two walks at time 0. 

We observe next that the continuum population is represented by At U Ef' U which 
is a version of M marked on F* by , the geographic marks are the reals and since the 
sampling measure is Lebesgue measure, we can write polynomials based on integration 
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over M instead of X x V as: 

(4.6) j j ^^®'^{d{x,u))g{u)<f{r{x)) = j X^'^{du)g{u)(p{r{u)). 

XV R 

We can rewrite using the duality of Corollary 11.231 see also (ll.4ip . the R.H.S. of (14.21) 
in the same form as in (j4.3|) : 

k k „ 

(4.7) =E[n / ff*((ya)i<a<nJ</>i((rf(i/i,yD)i<a<6<n.)dy*], 

i=l i=l 

where at each time ti, we sample Ui individuals from X^^ according to (which is 
Lebesgue measure on M) at positions (ga)i<a<ni, and their joint space-time genealogy lines 
are by construction distributed as a collection of coalescing Brownian motions (ya(s))s<ti, 
evolving backward in time. 

To link (j4.3l) with (j4.7p . we note that in (14.31) . we can regard 

k 

(4.8) 

*=1 3;‘,...,<.eZ 

as a finite signed sampling measure (recall that g has bounded support), which is easily 
seen to converge weakly to the finite signed sampling measure appearing in (14.71) . namely 


(4.9) 


'[l9i{{yl,)i<a<ni)dy\ - ■ ■ dyl,^. 


i=l 


To prove (14.2p . it then suffices to show that (having (j4.8p - (j4.9p in mind): If for each 
1 < a < Uj and I < i < k, Xa^ E Z and ecr^^Xa'^ ^ as e ^ 0, then 

(4.10) 

k k 


E 


[ ((^^ (^U (®a')> (.x];")))l<a<b<ni) ^ E ((r^^® (y* , yl))l<a<b<ni) 


2=1 


2=1 


Step 2 (Invariance principle for coalescents). We next prove (|4.10l) by means of 
an invariance principle for coalescing random walks. 

This invariance principle reads as follows. Given a collection of backward coalescing 
random walks starting at e-dependent positions Xa^ E Z at time I < a < Ui, 

1 < i < k, such that ea~^Xa‘^ ^ y* as e —0, the collection of coalescing random 
walks (xa^(s))s<g- 2 j., rescaled diffusively as {ea~^Xa‘^{€~‘^s))s<ti, converges in distribution 
to the collection of coalescing Brownian motions iyais))s<ti evolving backward in time. 
Furthermore, the times of coalescence between the coalescing random walks, scaled by e^, 
converge in distribution to the times of coalescence between the corresponding Brownian 
motions. The proof of such an invariance principle can be easily adapted from [NR,8051 
Section 5], which considered discrete time random walks with instantaneous coalescence. 
We will omit the details. 

Let (5 > 0 be small. Note that the collection of rescaled coalescing random walks 
(efT“^Xa'^(e“^s)) restricted to the time interval s E [(5,tfc]) together with their times of 
coalescence, converge in joint distribution to the collection of coalescing Brownian mo¬ 
tions {ya{s)) restricted to the time interval s E [<5, tfc], together with their times of co¬ 
alescence. Using Skorohod’s representation theorem (see e.g. |Bil89] l. we can couple 
(ecT“^Xa'^(e“^s))s 6 [ 5 ,tj,] and (ya(^))se[( 5 , 4 ] such that the paths and their times of coales¬ 
cence converge almost surely. Let us assume such a coupling from now on. 
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By the same argument as in the proof of Theorem 11.271 (c), we can rewrite the expecta¬ 
tions in (j4.10p in terms of the backward coalescing random walks Xa^ G Z and coalescing 
Brownian motions y\. Furthermore, we can condition on the coalescing random walks 
Xa'^is) on the time interval and condition on the coalescing Brownian motions 

y*(s) on the time interval [5,tk], coupled as above. 

Given the locations G Z of the remaining coalescing random walks at time 

we now make an approximation and replace them by independent random walks on 
the remaining time interval [0,(5e“^], and make a similar replacement for the coalescing 
Brownian motions. Note that the error we introduce to the two sides of ()4.10p is bounded 
by a constant (determined only by |</>j|oo, 1 < * < times the probability that there is a 
coalescence among the random walks (resp. Brownian motions) in the time interval [0, 

(resp. [0, (i]), which tends to 0 as <5 0 uniformly in e by the properties of Brownian motion 

and the invariance principle. Therefore to prove (I4.10p . it suffices to prove its analogue 
where we make such an approximation for a fixed <5 > 0, replacing coalescing random 
walks (resp. Brownian motions) on the time interval [0, (5e“^] (resp. [0,5]) by independent 
ones. Let us fix such a 5 > 0 from now on. 

By conditioning on the coalescing random walks and the coalescing Brownian motions 
on the macroscopic time interval [5, t^] and using the a.s. coupling between them, we note 
that the analogue of (I4.10p discussed above follows readily if we show: 


Lemma 4.1. //uf,...,u^ G Z satisfy ea —>■ u* as e —>■ 0, then for any bounded 

continuous function ip : M, we have 


(4.11) 


g^ixi, ..., xi)E[iP{{e^rQ^’\^p,{xi),Co{xj)))i<i<j<i)] 

xi 


where g^{x) is the probability mass function of I independent random walks at time 6e~‘^, 
starting atu\,..., uf; while gs{y) is the probability density function of I independent Brow¬ 
nian motions at time 6, starting at ui,... ,ui. 


Proof. If we can replace g'^(xi,..., x/) in (|4.11l) by (ecr ^Ygsiea ^xi,...,ecr ^x;), then 
(|4.1ip follows immediately by applying the polynomial <^h'>P,95 states and 

using the assumption —>■ The only problem is that gs does not have 

bounded support as we require for a polynomial. However, it is continuous and integrable, 
and hence can be approximated by continuous functions with bounded support. Therefore 
the above reasoning is still valid. 

To see why we can replace g^{x) by {ecr~^ygs{ea~^x), note that by the local central 
limit theorem (see e.g. 11176]), 

(4-12) {ea-^)-^gl{\e-^ay'\) ^ gsiy) 

— — 

uniformly in y G [—L, L]^ for any L > 0. Therefore when we restrict the summation in 
()4.1ip to X G [—e“^L, e~^L\\ the replacement induces an error that tends to 0 as e 0. By 
the central limit theorem, the contribution to the sum in (j4.1ip from x ^ [— 
can be made arbitrarily small (uniformly in e) by choosing L large, and hence can be safely 
neglected if we first let e —0 and then let L —>■ oo. I 


4.2. Tightness. In this subsection we prove the tightness of the family of rescaled IFV 
genealogy processes, (5eT^^’^)e>o, regarded as (^([O, oo),U’*)-valued random variables. 
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First we note that it is sufficient to prove the tightness of as random 

variables taking values in the Skorohod space L>([0, oo),U®). Indeed, the tightness of 
£>o in the Skorohod space, together with the convergence of to 

in finite-dimensional distributions, imply that => as li>([0, oo), U®)-valued 

random variables. In particular, {Xp^)t>o admits a version which is a.s. cadlag. Together 
with the fact that Xp^ is a.s. continuous in t > 0, which was established in the proof of 
Theorem 11.271 (b), it follows that Xp^ must be a.s. continuous in t > 0. Note that this 
concludes the proof of Theorem \1.27\ (b). 

Using Skorohod’s representation theorem (see e.g. [Bil89j 1 to couple and 

such that the convergence in D([0, oo), U®) is almost sure, and using the a.s. continuity 
of {Xp^)t>o, we can then easily conclude that —)• a.s. in (^([O, oo), U®), which 

implies the tightness of as C([0, oo), U®)-valued random variables. 


By Jakubowski’s criterion (see e.g. |Daw93[ Theorem 3.6.4]), to show that (5'eA’^^’'^)f;>o 
is a tight family of random variables in the Skorohod space T)([0, oo), U®), it suffices to 
show that the following two conditions are satisfied; 

(Jl) (Compact Containment) For each T > 0 and > 0, there exists a compact set 
Kt,s C U® such that for all e > 0, 


(4.13) 


P(S,Tf_Yf G Kt,5V0 < t < T) > 1 - ,5; 


(J2) (Tightness of Evaluations) For each / G 11^’^, (/(^^T^^f ))i>o, indexed by e > 0, 
is a tight family of i4([0, oo), ]R)-valued random variables. 

Note that 11^’^ (recall from ()1.9p ) separates points in U® by Theorem 11.81 and is closed 
under addition. 

We first prove (J2), following an approach used in [AS 11] . where a family of rescaled 
measure-valued processes induced by the voter model on Z is shown to be tight and con¬ 
verge weakly to the measure-valued CSSM as e —0. We will verify (J2) via Aldous’ 
tightness criterion (see e.g. |Daw93] Theorem 3.6.5]), reducing (J2) to the following con¬ 
ditions: 

PV e 

(Al) For each rational t > 0, {/(S'eA’^_ 2 ) )}e>o is a tight family of M-valued random 
variables. 

(A2) Given T > 0 and any stopping time < T, if | 0 as e | 0, then for each p > 0, 

(4.14) i™h’{|/(s.UT..+4)) - nseNXjl > i) = «■ 


Proof of (J2) via (Al)—(A2). Note that each / G can be written as / = P2i=i 
for finitely many G 11^’^ and Cj G M. It then follows by the triangle inequality that 
to verify (Al)—(A2), it suffices to consider / = G B^’^. We abbreviate for G B^’^, 

4 >| := HS^xJZlP. 

(Al). Note that for <I> = g B^’^, ($j) e>o is uniformly bounded, because (j) is 

bounded, g is bounded with bounded support, and the projection of on the mark 

space M converges to the Lebesgue measure as e j, 0, giving (Al). 

(A2). We will use the duality between interacting Fleming-Viot processes and coalesc¬ 
ing random walks (see Theorem 11.171 and Corollary ll.20p . Recall that <I>| = <h(S'eA’^^Yf). 
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First, we bound 


n\K+s.-K\>v) < = 


(4.15) 


7]^ 

2 


7]^ 


< _E[Var($;„, I X^X)] + I - K} ]. 


where in the second inequality, we added and subtracted | ] from 

and used (a + 6)^ < 2a^ + 2b^. We treat the two terms on the r.h.s. separately. 

First term in (I4.15p . We bound this term by bounding Var(<l>^^^ 5 ^ I ) uniformly 

in ■ First note that is a strong Markov process by Theorem 11.121 Therefore 

^ can be seen as the IFV genealogy process at time with initial 

condition . In particular, it suffices to bound Var(4>^ ) uniformly in the initial 

condition Tq ’ , which we can assume to be deterministic. 

Let $ = and denote x := (xi,..., Xn) € 'IF, y := {yi,..., yn) G Z”. Then by the 

definition of the scaling map in ()1.49p . we have 


Var(4.^J = =E[(<h^j2]_E[‘h^j2 

(4.16) = ^ g{(-cr-^x)g{ea~^y) Cov(0(eVf725" (x)), </>(e^V^ 25 jy))), 


where (x) denotes the distance matrix (^(x,), ^(xj))i<j<j<n of n individuals 
^(xi),... ,i{xn) sampled independently from Xjy 2 s positions xi,... ,x„ respectively. 
In order to evaluate the r.h.s. of (j4.16p we represent the quantity using the duality in 
terms of a collection of coalescing random walks as follows. 

Let {Xy)i<i<n and (X]^*)i<i<n denote a family of rate 1 continuous time random walks 
on Z with transition kernel a as in (I1.15P , and every pair of walks at the same site coalesce 
at rate 7 . The coalescence gives a partition of the set of coalescing random walks at time 
e~^6(,, and independently for each partition element, say at position z G Z, we sample 
an individual from at position 2 :. Let rg^’'^(X ^25 ) denote the distance matrix of 

the collection of sampled individuals associated with the walks j ■ ■ ■) X^Xg at time 

and let rQ^’'^(X^_ 2 ^ ) be defined similarly. We can further construct (X^^*)i<j<„, a 
copy of (X]^’-)i<i<n, which coincides with (X|'*)i<j<„ up to time e~‘^5e on the event 

Gf:- 2 s^{x,y) := {none of (X**)i<j<„ coalesces with any (X^*)i<j<„ before time e“^5e}. 


such that (X^')i<j<„ is independent of (X**)i<j<„. Let rQ^’^(X ^25 ) be the associated 
distance matrix, which is independent of rQ^’'^(X^ 2 ^ )• By the duality relation (see The¬ 
orem imi), we have 
(4.17) 


Cov(<))(e^r"''’^ 


— v-v- = Cov(0(e /'g ~r y- -y 

= E[<^(cVg"^’^(Xf_2,J)<()(6Vg^^’^(Xf_2,J)] - E[<^(cVg^^’^(Xf_2,J)]E[</>(cVg"^'^(Xf_2,J)] 




<2\cj,\lF{G,-2sM^yr)<2\ct>\ 


2 

00 


^{TxuVi ^ ^ 


- 2 ; 
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where Tx^^y^ denotes the time it takes for the two walks and to meet. Note that 
this bound is uniform w.r.t. Substituting it into (j4.16p then gives 

Var($^J < (ecj"^)2^ Y 9{e(r~^x)g{ea-^y)F{Txi,y^ < e-^d,) 

(4.18) = |(/>|^ Y Y1 i^<x~^?^g{x)g{y)F{T^-i^xi,e-^ayj < 

x,y£ea-^I," 

where x := ecr~^x and y := ea~^y. 

We claim that the r.h.s. of (|4.18p tends to 0 as e 0. Indeed, the measure 

{ea-^f'^ Y X] ^xiSy^9{x)g{y) 
l<i,j<n x,y£ea-^Z” 

converges weakly to the finite measure g{x)g{y)dxdy on as e 0. By Donsker’s 
invariance principle and the fact that 5^ —0 as e 0, we note that for any A > 0, 

^{Xe-^axi,e-^ayj < ^ 0 

uniformly in Xi and yj with \xi — yj\ > A. It follows that when restricted to Xi and yj 
with \xi — yj\ > A, the inner sum in (|4.18p tends to 0 as e J, 0. On the other hand, when 
restricted to Xi and yj with \xi — yj\ < A, the inner sum in p4.18p can be bounded from 
above by replacing P(.) with 1, which then converges to the integral of the finite measure 
g{^g{y)d^dy over the subset of with \xi — yj\ < A, and can be made arbitrarily small 
by choosing A > 0 small. 

This proves that Var(<I>^^) tends to 0 uniformly in xY''" as e 0, and hence the first 
term in p4.15p tends to 0 as e 0. 

Second term in (I4.15p . By the strong Markov property of it suffices to bound 

~ 4 * 0 ! uniformly in the (deterministic) initial condition Xq 
Let {X^^)i<i<ni ), and rf^’'^(x) be defined as before p4.17p . Let (W)^‘)i<i<n 

be a collection of independent random walks, such that (N'f*)i<j<„ coincides with {Xp)i<i<n 
up to time on the event 

G^- 2 s^{x) := {no coalescence has taken place among (W**)i<j<„ before time e~^5e}. 
Then we have 


Y 5 (efT"^x)E[())(eVfT 25 " (®))] - Y 


x&lX 




< (eo- ^)’"| Y - Y ME[0(e^Fo'^’"(y))] 


+ (eo- ^)’"| Y (2)} 


xeZ” 


Note that the second term in the bound above tends to 0 as e {, 0 by the same argument 
as the one showing that the bound for Var(4>U in (I4.18D tends to 0 as e | 0. To bound 
the first term in the bound above, we decompose according to the positions of the random 
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walks and rewrite it as follows, where pt{x) denotes the transition probability kernel of 
(4.19) 

^ 5r(ecj"^x)IE[(/)(eVo^’"(Xf_2^J)] - ^ fif(ea"^y)E[(/.(eVo^’"(y))] 

n 

= (^^~^)"| WPe-nSVi-^i)9{<^^~^x)K[(l){e‘^rl^'\y)] - ^ ff(ecj"^y)E[(/.(eVo'^’"(y))] 

x£l/^ i=l yGZ"- 

n 

< l-^loo (eo-"^)’"| Y YlPe-^sAVi - Xi){g{ea~^x) - g{ea~^y)) 
y^l/^ i=l 

n 

|</>|oo ^ YlPe-^sSVi- 

, -1 / NX/ 

+ -^- {x-yy g{ecr ^u{x,y)){x- 


yEZ^ 


rcEZ’^ 2=1 


2=1 


^ C^|^|oo|V 9 \oo{^^ ) ^ ^ J_ J_ Pe~‘^Se (^{ef7“^^Esupp{5')} ^{ea~^y£s\ipp(g)}) ^^{Vi ^i) 

x^yGli'^ 2=1 

— 2nC'|())|oo|V 5|oo(e<T ) l{eo—lj/esupp(g)} 


where C is a constant depending only on n. In the derivation above, we Taylor expanded 
g(ea~^x) around ea~^y when either ea~^x or ea~^y is not in the support of g, Vg and V'^g 
denote the first and second derivatives of g, and u{x, y) is some point on the line segment 
connecting x and y. Lastly, we used the fact that Zzgz ^Pti^) = 0 and Zzgz z'^Ptiz) = ta'^. 
Since g has bounded support, the bound we obtained above is bounded by C'lJg for some 
C depending only on n, (p and g, and hence tends to 0 as e | 0. 

This verifies that the second term in (I4.15P also tends to 0 as e 0, which concludes the 
proof of (A2). ■ 


We have verified (J2) above and hence to conclude the proof of tightness of 
as a family of random variables in the Skorohod space I1([0, oo), U®), it only remains to 
verify the compact containment condition (Jl). Some technical difficulties arise. Because 
the geographical space is unbounded, truncation in space is needed. We also need to control 
how the sizes of different families fluctuate over time, as well as how the population flux 
across the truncation boundaries affect the family sizes. Our strategy is to enlarge the 
mark space by assigning types to different families. Using a weaker convergence result. 
Theorem 11.321 for measure-valued IFV processes with types (but no genealogies), we can 
control the evolution of family sizes as well as their dispersion in space, which can then 
be strengthened to control the genealogical structure of the population. As we will point 
out later. Theorem 11.321 can be proved by adapting what we have done so far, because 
condition (Jl) is trivial in that context. Therefore invoking Theorem 11.321 to prove (Jl) 
for the genealogy processes is justified. 

Proof of (Jl). As noted in Remark ll.4l we can regard U* as a subset of (U*)^, endowed 
with the product M-marked Gromov-weak topology. Since the product of compact sets 
gives a compact set in the product space (U*)^, to prove (Jl), it suffices to show that 
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for each A; G N, the restriction of 


I )t>0,e>0 


to the subset of marks {—k, k) C M, i.e., 


^(fc)^FV,. _ t > 0, e > 0, 

satisfies the compact containment condition (j4.13jl . More precisely, it suffices to show that 
for each T > 0 (which we will assume to be 1 for simplicity), and for each h > 0, there 
exists a compact Kg C U®, such that for all e > 0 sufficiently small, 

(4.20) eKsyO<t<l) >1-6, 


where A; G N will be fixed for the rest of the proof. 

We will construct K^,K‘j,K^ C U®, which satisfy respectively conditions (i)-(iii) in 
Theorem lA.ll for the relative compactness of subsets of U®. We can then take Kg := 
Kj n n K^, which is a compact subset of U®. To prove (j4.20p . it then suffices to prove 
the same inequality but with Kg replaced by for each 1 < i < 3, which we do in 
(1)—(3) below. 

Later when we construct iL| and Kg, we will keep track of the mass of different indi¬ 
viduals having some specified properties. The way to do this is to introduce additional 
marks. We will enlarge the mark space from M to M x [0,1], where [0,1] is the space of the 
additional types, and and then become random variables taking values in the 

space U®^ . For each {X, r, G U®^ , /i(dxdudr) is then a measure on X x M x [0,1]. 

The types of individuals in and will be assigned later as we see ht. 

(1) First let Kg be the subset of U®, such that for each {X,r,fi) G Kg, fi{X x •) is 

supported on [—A;, A:], with total mass bounded by 4A:. Since the family of measures on 
[—A:, A:], with total mass bounded by 4A:, is relatively compact w.r.t. the weak topology. 
Kg satisfies condition (i) in Theorem I A. 11 We further note that a.s., € Kg for 

all A > 0, and hence (j4.2np holds with Kg replaced by Kg. 

(2) For each n G N, we will find below L[n) such that if Kg^ denotes the subset of U® 
with 

(4.21) [[ l^rix,y)>L(n)}fJ-i^xdu)^i{dydv) < - for each {X,r,y) G Kg’"^, 

JJ{XxR)^ 'n 

then uniformly in e > 0 sufficiently small, we will have 

(4.22) F{SX^X^Xt i Kf^ for some 0 < A < l) < ^ =: 

We can then take Kg := CinenKg’"', which clearly satisfies condition (ii) in Theorem lA.il 
while (14.220 implies that (|4.20l] holds with Kg replaced by Kg. 

In order to hnd L{n), we proceed in two steps. First we hnd an analogue of L{n) for 
the limiting CSSM genealogies, and then in a second step, we use the convergence of the 
measure-valued IFV to obtain L[n). 

Fix n G N. To hnd L{n) such that (I4.22p holds, we hrst prove an analogue of (j4.22p 
for the continuum limit by utilizing the types of the individuals. Given t] > 0 and 
7 G [0,1], let 

(4.23) := {(X,r,/x) G : fi{X x [-A;, A:] x [0,7]) < 7}- 

We claim that we can hnd A sufficiently large, such that if all individuals in with 
spatial mark outside [—A, A] are assigned type 0, and all other individuals are assigned 
type 1, then 


(4.24) 


P(A’P G Go,o VO < A < 1 ) > 1 
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In other words, with probability at least 1 — <5^/4, the following event occurs: for all 
0 < t < 1, no individual in with spatial mark in [—k, k] can trace its genealogy back 
to some individual at time 0 with spatial mark outside [—j4,j 4]. This will allow us to 
restrict our attention to descendants of the population in [—A, A] at time 0. 

Indeed, by the construction of the CSSM (Section 11.41) using the Brownian web, the 
measure-valued process which is the measure /rp® projected on the geographic and 
type space, is given by 
(4)25) 

Tj (dudr) = o) 

where f(±A,o)i') the two coalescing Brownian motions in the Brownian web W, starting 
respectively at (±74,0). The event in (|4.24l) occurs if f(±A,o) do not enter [—k,k] before 
time 1, the probability of which can be made arbitrarily close to 1 by choosing A large. 

Let tL > 0 be chosen such that (I4.24h holds. Since Tj)"® = {Xp^, G 

the population in ® with spatial marks in [—^4, A] can be partitioned into a countable 
collection of disjoint balls of radius 4. Let us label these balls by Bn, 2 , • ■ • C in 

decreasing order of their total measure, and all individuals in Bi is given type i G [0,1], 
while all individuals with spatial mark outside [—^4, A] are given type 0. Note that by 
choosing M large, the total measure of individuals in with types in (0, can be 

made arbitrarily small. It then follows by the Feller continuity of the measure-valued 
process stated in Remark 11.291 that as M —)• 00 , the total measure of individuals in 
Tjp®, t G [0,1], with spatial marks in [—k,k\ and types in (0, -p], converge in probability 
to the stochastic process which is identically 0 on the time interval [0,1]. Combined with 
(I4.24p . this implies that we can choose M large, such that 

(4.26) P(T’/"® ^ j_ for some 0 < t < l) < —. 

Let L{n) denote the maximal distance between the balls ..., Bn,M- Note that on the 
event that {Xp^ G j_ for all 0 < t < 1}, for each t G [0,1], the set of individuals 

4fcn ’ M 

in Xp^ with spatial marks in [—k, k] and types in [0, -^] have total measure at most 
while the rest of the population with spatial marks in [—k, k] trace their genealogies back 
at time 0 to Bn,!, ■ ■ ■, Bn,M, and their mutual distance is bounded by L{n) ± 2. Letting 
L(n) = L{n) ± 3 in ()4.2ip then readily implies that {Xp^,rp^, l{|^|<fc}/ip®(dxdu)) G Kp"' 
on the event Xp^ G G^ _L; and (|4.26l) implies that 

4fcn ’ M 

(4.27) P((XP,rP, l{|„|<fc}//P(dxdu)) ^ for some 0 < t < l) < y, 
which is the analogue of (I4.22p for 

Next we turn to and we will deduce (14.221) from (j4.27l) by exploiting Theorem 1 1.32 1 

on the convergence of the measure-valued processes to . Let A be the same 

as above. Let individuals in be assigned types in {0} U {i : i G N} as before (14.261) . 
where L{n) denotes the maximal distance between any pair of individuals in with 
types in The assumption that converges to TJp® in U® as e | 0 allows 

a coupling between SeXp^'^ and such that for most of the individuals in SeXp^'^ 

with spatial marks in [—4,74], their genealogical distances and spatial marks are close 
to their counterparts in We can then assign types to individuals in SeXp^''^ in 

such a way that: individuals with spatial marks outside [—74,74] are assigned type 0 while 
those with spatial marks in [—74,74] are assigned types in {i : i G N}. The distance 
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between any pair of individuals with types in []g,l] is bounded by L{n) + 1, and the 
measure on geographic and type space, converges vaguely to as e 0. By 

Theorem 11.321 {Se^_ 2 ^)o<t<i converges weakly to as random variables in 

C([0,1] , Ad(M X [0,1])). Applying this weak convergence result to (I4.26P then implies that 
for e > 0 sufficiently small, we have the following analogue of ()4.26p : 

(4.28) i G j_ for some 0 < t < l) < (5„,. 

Note that on the event € G j_ for all 0 < t < 1, for each 0 < t < 1, the 

^ * 2fcn ’ M 

set of individuals in with spatial marks in [—k, k] and types in [0, have total 

measure at most while the rest of the population with spatial marks in [—k,k] trace 
their genealogies back to an individual at time 0 with type in and hence their 

pairwise distance is bounded by L{n) = L(n) + 3. It follows that G k'^’^ on the 

event G G^ j_. Together with (I4.28p . this implies (14.2211 . 

(3) Our procedure for constructing is similar to that of K^. For each n G N, we will 
find M = M (n) such that if denotes the subset of U® with the property that for each 
(A, r, G we can find M balls of radius ^ in X, say Bi,..., Bm with B := L)f£^Bi, 

such that fi{X\B x M) < then uniformly in e > 0 sufficiently small, we have 

(4.29) for some 0 < t < l) < ^ =: 

We can then take iF| := , which clearly satisfies condition (iii) in Theorem lA.il 

while (14.291) implies that (14.2011 holds with Ks replaced by K^. 

To find M{n) such that (I4.29P holds, we partition the time interval [0,1] into 
for 1 < i < 2n. It then suffices to show that for each 1 < i < 2n, we can find Mj(n) such 
that if is defined using Mi, then uniformly in e > 0 small, 

(4.30) ^ for some < t <—) < — = . 

Again we first determine M[n) for CSSM and then use the convergence of measure¬ 
valued IFV to measure-valued CSSM. We now prove an analogue of (j4.30p for Since 
VH.® G almost surely, we can condition on its realization and partition the pop- 

2n 

ulation in with spatial marks in [—A, A] into disjoint balls of radius Bi, B2, ■ ■ ■, 

2n 

as we did in the argument leading to (|4.26p . Repeating the same argument there and 
assigning type i to individuals in Bj, we readily obtain the following analogue of (I4.26p : 
we can choose Mi large enough such that 

(4.31) P(A/^® ^ G 1 1 for some ^- < t < —) < —. 

^ ^ V t ^ 2n - - 2n’ “ 4n 

Note that on the event G G±_ j_ for all ^ t for each t G ^], the set 

of individuals in X^^ with spatial marks in [—k, k\ and types in [0, have total measure 

at most while the rest of the individuals in Xp^ with spatial marks in [—k,k] trace 

their genealogies back to an individual at time in i?i U R 2 • • • U BMi , and hence they 

are contained in Mj balls of radius + t Therefore {X^^,r^^, l{|^|<fc}^^®(dxdu)) G 

and the analogue of (jd.dOp holds for 
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To establish (j4.80j) uniformly in small e > 0, we again apply the convergence result in 
Theorem 11.321 Note that by the f.d.d. convergence established in Section 14.11 

converges in distribution to TH.® as e 0. Following the same argument as those leading 

“SIT 

to ()4.28[) . we can then assign types to individuals in such that the associated 

^ “SIT ^ 

measure-valued process, i converges weakly to 

dividuals in with spatial marks in [—A, A] and types in 1] are contained in 

^ “ST * _ 

Mi balls with radius at most Applying this convergence result to (I4.3ip then implies 
that for e > 0 sufficiently small, we have the following analogue of (j4.31l) : 

(4.32) ^ Gi j_ for some —— <t< —) < 

' n'Mi In In' In 

This is then easily seen to imply (|4.30p . 

Combining parts (l)-(3) concludes the proof of (|4.20l) and hence establishes the compact 
containment condition (Jl). ■ 


4.3. Proof of convergence of rescaled IFV processes ^Theorem ll.32p . In |AS11[ 
Theorem 1.1], a convergence result similar to Theorem 11.321 was proved for the voter 
model on Z, where the type space consists of only {0,1}, and a special initial condition 
was considered, where the population to the left of the origin all have type 1, and the rest 
of the population have type 0. The proof consists of two parts: proof of tightness, and 
convergence of hnite-dimensional distribution. 

In [ASll] . the proof of tightness for the voter model does not depend on the initial 
condition, and is based on the verihcation of Jakubowski’s criterion and Aldous’ criterion 
as we have done in Sections 14.11 and 14.21 for the genealogical process. Because the IFV 
process ignores the genealogical distances, the verihcation of the compact containment 
condition (Jl) in Jakubowski’s criterion is trivial, as in the case for the voter model. 
Using the duality between the IFV process and coalescing random walks with delayed 
coalescence, Aldous’ criterion on the tightness of evaluations can be verihed by exactly the 
same calcnlations as that for the voter model in [AS11| . which uses the duality between 
the voter model and coalescing random walks with instantaneous coalescence. Recall 
here that in the rescaling the difference between instantaneous and delayed coalescence 
disappears because of recurrence of the difference walk. Lastly, the convergence of the 
hnite-dimensional distribution for rescaled IFV process follows the same calcnlations as 
in Section im where we can simply enlarge the mark space to M x [0,1] and suppress the 
genealogical distances by choosing = 1. ■ 


5. Martingale Problem for CSSM Genealogy Processes 

In this section, we show that the CSSM genealogy process constructed in Section 11.41 
solves the martingale problem formulated in Theorem 11.401 We will hrst identify the 
generator action on regular test functions evaluated at regular states, and then extend it 
to more general test functions and verify the martingale property. Complications arise 
mainly from the singular nature of the resampling component of the generator, which are 
only well-dehned a priori on regular test functions evaluated at regular states. Fortunately 
by Proposition 11.251 the CSSM genealogy process enters these regular states as soon as 
t > 0, even though the initial state may not be regular. 
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5.1. Generator Action on Regnlar Test Pnnctions. In this section, we identify the 
generator of the CSSM genealogy process acting on $ G and evaluated at 

ACS 

G U®, where Ilr’^ and U® are introduced in Section [1.61 The advantage of working 
with such regular and A^S jg that, the relevant resamplings only occur at the boundary 
points of balls of radius at least 5, which is a locally finite set. In Section 15.21 we will 
extend it to the case G where we need to consider the boundaries of all balls in 
AjCSj which is a locally infinite set. 


Proposition 5.1. [Generator action on regular test functions] 

Let ACS := (aCS)^>q the CSSM genealogy process with A^s g u®, Let G 

defined as in Definition \ 1.361 Let L^s = _|_ ^cs _|_ ^cs defined as in (ll.53p - 

(11.561) . Then we have 


(5.1) 


^.^^ E[$(ACS)]-c|>(AoCS) 
tio t 


LCS$(;t’oCS). 


Furthermore, 


(5.2) 


E[$(aP)] = $(Ao^'^) + / E[L^'^$(A}^^)]ds 


^CS 


CSf 


^CS\ 


where E[LCS<i)(ACS^j continuous in t > 0. 


^CS 


The proof is fairly long and technical and will be broken into parts, with (15.ip and (15.211 
proved respectively in Sections 15.1.11 and 15.1.21 


5.1.1. Proof of (|5.1I) in Provosition I5.il For each t > 0, denote A^s = ,rf^, 

Let L > 0 be chosen such that the support of g{P) is contained in (—L, L)^. Let 5 > 0 be 
determined by cp as in (|1.52p . so that 4>{{rij)i<icj<n) is constant when any coordinate rjj 
varies on the interval rij G [0,(5]. 

We proceed in five steps, first giving a suitable representation of <1>(XCS) and <h(AC®), 
and then calculating actions that lead to the different parts of the generator. 

Step 1 (Representation of <1>(A^®)). We derive here a representation of <h(A^®) 
by partitioning into disjoint balls of radius at least 6/A and utilizing the fact that 
$ G 

Since A(p^ G U®, by Remark [1.351 and ()1.42p . we can identify for each t > 0 with 
(5.3) M.;:= (j{x+,x-}[j{R\Et), 

x£Et 

where x^ and x~ are duplicates of the point x in 
(5-4) Et = UiyoE^, 

where El is the set of points in M that lie at the boundary of two disjoint balls of radius I 
in Xf^, which is consistent with the definition in (11.4611 . 

Denote 

(5.5) {m < 1/2 <■■■ < 1/m} ■■= Elf n [-2L, 2L], 

s 

and let yo, resp. Vm+i, be the point in Eq adjacent to yi, resp. ym- Note that the intervals 
[vt^yf+il il/i^yi+i) U {yf,yfi.i} form disjoint open balls of radius | in A^®. Therefore 
for all Xi G [yf ,y~j^i] and X 2 G [y'j with i / j, df/f := rP(xi,X 2 ) is constant, and 
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do ■= {do)o<i<j<m forms a distance matrix, where we set dg ;= 0 for 0 < i < m. We can 
then write 

(5.6) ci>(AfCS) = ^ ^ ct>{df-) G{y,k), 

fcE{0,-" ,m}’^ 

where y = {yo, ■ ■ ■ ,i/rn+i), is the distance matrix with doij” ~ 1 < * < i < ?!•, 

and 

(5.7) G{y,k)= J---J 9 {x)dx. 

yki<^i<yki+i 

l<i<n 


Step 2 (Representation of <h(X)^®)). We next write <h(Ai/^®), for t > 0, in terms of 
coalescing Brownian motions running forward in time (in contrast to the spatial genealogies 
which run backward in time), which determine the evolution of boundaries between disjoint 
balls in 

Let {yi+Bi}o<i<m+i be independent Brownian motions starting from {yi}o<i<m+i, from 
which we construct a family of coalescing Brownian motions {yi + i?j}o<j<m+i- Namely, 
let yo + Bq := yo + Bq for all time, and let yi + Ri(s) := yi + Bi{s) until the first time 
yi + Bi{s) hits yo + Bo. From that time onward, define yi + Bi to coincide with yo + Bo. In 
the same way, we successively define {yi + Bi} 2 <i<m+i from {y^ + Bi} 2 <i<m+i by adding 
one path at a time. Without loss of generality, we may assume that yi + Bi is the a.s. 
unique path in the Brownian web W starting from (yi , 0). 

To write in terms of the forward coalescing Brownian motions B, we observe 

that 

(5.8) E^^n[yo + Bois),ym+i+Bm+iis)]C [J{yi + Bi{s)} V 0 < s < min |-, f|, 

i=0 

since by our construction of in Section [LTl {yi + Rj(s))o<i<m+i are boundaries of 
disjoint balls in Xp^, and [{yi + Bi{s))^, {yi+i + 7?i+i(s))~], 0 < i < m, consists of either 
empty sets if yi + Bi{s) = yi+i + Bj+i(s), or disjoint open balls of radius | + s in Xf’®. 

Let {meetjt denote the event that either yo + Bo{s) reaches level —L before time t, or 
ym+i + Bm+i{s) reaches level L before time t, or one of the pair {yi + Bi{s),yi^i +i?j+i(s)) 
meet before time t. On the complementary event {meetjf, Bi = Bi for all i. Therefore, 
for all 0 < t < we can write 

(5.9) d>(TP) = l{n,eetT<^>(<^P) + l{meet}?$(<^P) 
with 

(5.10) 5(XP)= M"hG{y + B{t),k), 

fcSfO,... ,m}^ 

where dy^- is the distance matrix with dj^^ = _)_ 2t(l — 5k^^kj) for 1 < i < j < n, and 

B{t) = {Bo{t), • • • , B^+i{t)). 

Since 

(5.11) mxp^) - $(TP)| = l{„,eet}J‘b(TP) - 5(TP)| < C$l{^eetT 

for some G^ depending on <I>, and the probability of the event {meet}* decays exponentially 
in by elementary estimates for Brownian motions, we have limi^o — )] ^ _ 

0. Thus, we may replace ^{Xp^) by ^{Xp^) up to error o{t) as t —)• 0. 
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By (j5.6j) and (j5.10j) . we can write 
(5.12) 

= ^{cPidf-) - 4>{df^)) G{y + m)A) + Y.^{df-){G{y + B{t)A) - Giy,k))- 


Step 3 (Aging). We first identify the aging term in the generator, defined in 
()1.55l) . For each term in the hrst sum in (I5.12p . by Taylor expanding 0(dy^-) in t, it is 
easy to see that 


(5.13) limt ^E[((^(4''-) - (^(4""-)) G(y+ B(t),fc)] = 2 ^ G{y,k), 


l< 2 <j<n 


dcj) 

ij 


dr. 


where we note that = 0 if /cj = kj, since in this case the ij-th argument of 

k'k' ^ 

(j), d^ ^ , is less than 5. Summing the above limit over k G {0, • • • , m}” and using the 
definition of G, we find 

1 




(5.14) 


fce{0,--- ,m}” 


2 [ g{x) Y. 


where r ;= (r^®(xj, Xj))i<jj<n. This gives the aging term . 

Step 4 (Resampling). We next identify the resampling term defined in (|1.56l) . 
For the second sum in (|5.12l) . we need to compute 


7 X] + B(t),fc) - G(y,fe)] 


(5.15) 


fce{ 0 ,-" ,m}" 


= Y f^^^[G{y +- G{y,k)]. 

fcS{ 0 ,--- ,m}" 


For each G {0, • • • , m}"', 
(5.16) 


G{y + mt)A)-G{y,k) 



+-®fci (i) iJ/fci+1 _|_i 


(t)]( 2 :i) 




2=1 


dx. 


We can rewrite the difference of the product of indicators as 
(5.17) 

n n 

n ~ '^[yki,yki+Bki(t)]i^i) + Myk^+^,yk^+l+Bk^+^{t)]i^i)) ~ (®*)- 

i=l i=l 

We can expand the first product above and sort the result into three groups of terms, 
(Gi), (G 2 ) and (G 3 ), depending on whether each term contains one, two, or more factors 
of the form l[yi,j/i+Si(t)] for some 0 < z < m + 1. If h{x) denotes a term in (G 3 ), then 
necessarily J-^n g{x)h{x) < Gg\Bij^{t)Bi^(t)Bi^(t)\ for some Gg depending only on g and 

3 

some zi,Z 2,*3 G {0, • • • ,m+ 1}. Since 'E\Bij^{t)Bi^{t)Bi^{t)\ < terms in (G 3 ) do not 
contribute to the limit in (I5.15p . 
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Each term in (G 2 ) is of the following form, where 1 < * 7 ^ J < n, 

^ Ivki ,yki +-Bfe . (t)] (3^1) 1 [ykj ,ykj +Bkj (t)] ) ri ^ Ivkr -J/fcr+l] ^ ’ 

l<T<n 

'^[yki+i,yki+i+Bk^+i{t)] i^i)Mykj+i,ykj+i+Bkj+i{t)]i^j) \yk.r,yk^+i]^^T)^ 

l<T<n 

~ ^ [vki+i ,yki+i+Bki+i (4)] ) 1 [vkj ,ykj +Bkj (*)] (^i) 11 ^ Ivkr -i/fer+il ) ’ 

l<T<n 

l<T<n 

t 7» j' 


(5.18) 


Denote the four terms in ()5.18l) respectively by h\^\x), h\^\x), h®(x) and h\j\x). Then 
(5.19) 



••• ,yki,--- .Vkp--- ,X„) + o(l))dx, 


where we replaced g'(x) by g{xi, ■ ■ ■ ,yki, - ■ ■ ,ykjr " : ^n), with an error of o(l) as t | 0 . 
Therefore 
(5.20) 

lim -E 
t —^0 t 

yk^<^T<yk^+i 

T^i,j 


hfj{x)g{x)(ix 


= 4,:, 


9{xi,- 


I Vki 


) yk-j 


11 dx^ 


We obtain similar results for h^‘^\ and ■ 

For a fixed pair i j, when we sum over k and all contributions from h\j in (j5.15p . we 
obtain an integral for 4’{r)g{x), where Xr, t ^ i,j, are still integrated over M"", however 
the integration for Xj and Xj are replaced by summation over {ya}i<o-<m- Contributions 
only come from x* = Xj, and is positive when ki = kj, and negative when kt = kj + 1 or 
kj = ki + 1. Writing everything in terms of Tfp®, we easily verify that the contribution of 
terms in (G 2 ) to the limit in (|5.15l) is exactly 

(5.21) ^ L ^ 'i-{xi=xj}gix)iOij(p - 4))iz) n dx,-, 

* Xi,XjG{yt,yaX<a<m} 

where Bijcj) is defined as in (jl.l9p . and r := (xq ^(xj, Xj))i<ij<n. This gives the resampling 
term defined in (11.561) . 

Step 5 (Migration). Lastly we identify the diffusion (migration) term defined in 
(|1.54p . We note that each term in group (Gi) is of the following form, where 1 < j < n. 


'^[ykj,ykj+Bkj{t)]i^j) H 

l<i<n 

i¥=i 

\ykj+i,ykj+i+Bkj+\{t)]{xj) H 

l<i<n 


(5.22) 
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Denote the two terms respectively by (x) and (x). Then 

f h^l\x)g{x)dx 


Vkj +Bkj (t) 


(5.23)= -/•••/ J (^9{xi.^y^) + ixj-ykj)-^^{xl.^y^) + o{\xj-yk^^^^ 


yki<^i<yki+i xj=yk 

Therefore, 

(5.24) 


lim -E 
t—^0 i 


[ hf^\x)g{x)d;i 

JR" 


dg 


[X 


) dxj. 


! / J dxj '^j—Vkj 

•G , J l<i<n 

Vki<^i<Vki + l 


( 2 ) 

A similar result holds for h) . Combining the two, we see that 


(5.25) 


lim -E 
t—^0 t 


[ {hf\x) + hf\x))g{x)dx j ■ ■ ■ j ^^{x) JJ dx*. 

■ J J J j i<i<n 


Vki ‘^‘^i'^yki + 1 


Therefore, the contribution from terms in (Gi) to the limit in (|5.15p gives precisely the 
migration term defined in (|1.54p . This establishes the generator formula (15.11) . ■ 


5.1.2. Proof of (j5.2p in Proposition 15.11 The complications in proving (15.2p arise from 
trying to prove uniform integrability for various quantities. We proceed in three steps. 
First we show that, for each t > 0, 

E[<h(TCSj-E[<h(ACS)] 


(5.26) 


lim ■ 


h 




By the Markov property of {Xp^)t>o, 


(5.27) 


lim ■ 

h^O 


IE[<l>(T^CSj_E[ci,(A^P)] 


= limE 

h 4-0 


E[ch(ACSj|AiP]-cI>(AP) 


Since E U® a.s. by Proposition 11.251 limfe|o ^ ^ al¬ 

most surely. Therefore, the first step is to interchange limit and expectation in (15.271) and 
to deduce (j5.26p . We need to show that 
(5.28) 




- 


is uniformly integrable for h > 0 small, say 0 < h < -. 

h>0 2 

Once the uniform integrability has been verified, in Step 2 we prove that E[L*^®<1>(A’/^®)] 
is continuous in t, and then in Step 3 we put things together and prove (|5.2p . 


Step 1 (Uniform integrability). This step constitutes the bulk of the proof of (15.21) . 
Let us fix and examine the error terms in our earlier calculation of the generator 
formula (|5.ip with replaced by Instead of partitioning Ap® into disjoint open 

balls of radius <5/4 as done in (15.5p . we set 

SAt 

(5.29) {yi < ••• :=E/ n[-2L,2L], 

which is determined by the dual Brownian web W as seen from the definition in (|1.46p . 
This choice of partitioning of removes the dependence of y = (yi,... ,ymt) the 
initial condition A,^®. Note that mt depends on 
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Let Et[-] denote the conditional expectation Following the arguments leading 

to (|5.1ip . 


(5.30) 


h-^ 
= h-^ 






where {meet}/i is the event that either uq + Bo{s) hits level —L before time h, or Hmt+i + 
i?mt+i('S) hits level L before time h, or one of the pair {yi +Bi{s),yi+i +Bi+i{s)) coalesces 
before time h. We now estimate the two terms in (|5.3UI) . 


(i) We start with the second term in (15.301) . Based on the decomposition (|5.12l) . 

~ = H{h,^{h)) for some function H{h,zo, - ■ ■ ,Zmt+i) which is con¬ 

tinuously differentiable in h and three times continuously differentiable in zq, ..., Zmt+i 
with uniformly bounded derivatives. Since the generator formula (15.11) is derived by Taylor 
expanding Ff(/i, it is not hard to see that uniformly in /i G (0, ^), 


(5.31) 

Ei[$(TCSj_c^(^P)]_/,LCSc^(TP) 


|Ei[iL(/i,B(/i))]-/iLCS$(A'P)| < Cg^^{mt + Z)h 


for some depending on g and (j), and rtit+'i is the number of variables va. H{h, zq, ..., Zmt+i 
Therefore 


(5.32) h-^\EtmX,%l) - <1>(tP)]| < Cg,4mt + 3) + 

uniformly for /i € (0, ^). From the definition of we note that 

(5.33) |lCS$(tCS)| < Cg^^(i + mt) < oo. 

By the definition of mt in (I5.29P and by Lemma IB.51 we have E[mt] < < oo. This 

implies the uniform integrability of the second term in (j5.30p for h G (0, ^). 

(ii) We now consider the first term in (j5.30p . For 0 < f < mt, let be the first 

hitting time between yt + Bi(s) and i/j+i -|- Bi^i{s). Let tq be the first hitting time of level 
—L by j/o + Bq{s), and Tmt+i the first hitting time of level L by y-mt+i + Bmt+i{s). Then 

(5.34) 

h-^Et [l{^eet}, I ‘h(^t?J - ‘h(<^i^i) I] 

mt 

< 2|4'|oo/i“^Et -f h~^Et 1|t-. .^^<fej|4>(T’fffe)-$(T)^fe)| . 

i=0 

Since yo < ~ 2 L and y-mt+i > 2L, the probability of the events {tq < h} and {rmt-i-i < h} 
decay exponentially fast in h~^ and the events are independent of Therefore the 

first term in (I5.34p is uniformly bounded in /i > 0. 

Bounding the second term in (|5.34l) is more delicate, because it remains of order 1 
as /i i 0. We will need to use negative correlation inequalities for coalescing Brownian 
motions established in Appendix O 

Let us recall the definition of 4>(T:)^®^) from (|5.10p . where we replace the pair {0,1} by 
{1,1 -|- h}. By integrating over the population at time t + h, we note that both $(A)^®^) 
and 4>(A}*(^^) can be written as integrals of g{x)4>{L{x)) integrated over x = (xi, • • • , Xn) G 
(—L, L)", except that: for a given x, the distance matrix r{x) may be different for 4> and 
4>, and for 4>, the same point x may be integrated over several times with different distance 
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matrix r(x) due to the fact that {yi + i?j(/i)}o<j<mt+i may not have the same order as 
{yi}o<i<m+i- However, for x not in 

(5.35) 1) := |x € M”' : Xj G {yj + Bj{s)} for some 1 < i < n and 0 < j < m* + l|, 

se[o,/i] 


X is integrated over exactly once in and the associated distance matrix r(x) is the same 
for both and <h. Therefore, contributions from x^D cancel out in 

Since g has compact support, the contribution from x^Dto includ¬ 

ing multiple integrations over the same x by $, is at most C(^,gu|g|oo| 0 loo 
where Ri{h) := (supo<s</i Hj(s) - info<s</i Hj(s)). Therefore, 


h-^E, 


rrit 


i=0 


(5.36) 


mt mt+1 

i=0 j=0 

mt mt+1 

< Cn,tit,gh-^ Y. Y. IPs(ri,i+i < h)mB[R,{hf]^ 

2=0 j = 0 

mt 

< CY,gh~Hmt + 2)Y^BiTi,i+l < h)5, 

i=0 


where Eb denotes expectation w.r.t. the Brownian motions H = {Bq, ..., Bmt+i), and we 
applied Holder inequality and the fact that E[Rj{h)‘^]^ = C'/h for some c > 0. It only 
remains to prove the uniform integrability of the r.h.s. of (I5.36P w.r.t. the law of for 
0 < h < 

Note that the r.h.s. of (j5.36p depends on yo and i/mt+ij which lie outside [—2L,2L]. To 
control the dependence on yo and ymt+i^ we enlarge the interval and let {zi, • • • , zm+i} ■= 

SAt 

n (—2L — 1, 2L -|- 1) as in (I5.29p . which contains {yi, • • • , ymt} as a subset. Denote 




Vi+i - Vi 

Vh 


) := P(rj,j+i < h)2 


•i 


Vj + l-Vj 
G 2 F 


e 



2 


we can then replace the r.h.s. of (|5.36p by 


(5.37) 

because C'^ ^ gFh{Xp^) dominates the r.h.s. of (I5.36P except for possible missing terms 
;^V^( ^’^° )(mt + 2), resp. ) (ruf + 2), on the event y^ < —2L — 1, resp. 

Umt+i > 2L + 1. Since yi > —2L and ymt < 2L by dehnition, both missing terms are 
bounded by Cimt + C 2 uniformly for h > 0, and is thus uniformly integrable. 

It only remains to prove the uniform integrability of Fh{Xp^) uniformly in 0 < /i < 

We achieve this by bounding its second moment. Note that by Cauchy-Schwarz, 


E[Fh{X, 








M 




Zj+I - Zj 

'/h 

i=l ^ 


i=l 

M 


< -E 
h 


M+l M 

Y Y^i 

^1?^25^3 = 1 M = 1 


■^24 + 1 ^24 \ 2 


Vh 


(5.38) 


< 
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Note that the above summation can be seen as a summation of xi := Zjj, X 2 := := Zi^ 

in {zi,..., zm+i} and X 4 := Zi^ in {zi,... ,zm}- By integrating over xi, X 2 , 0:4 over 
[— 2 L — 1 , 2 L + 1 ] and the position X 5 := Zi^j^i G (x 4 , 2 L + 1 ), we can rewrite the r.h.s. 
of in terms of the correlation functions of the translation invariant simple point 

5At 

process ^ := C M. More precisely, it can be written as 

X 5 — X 4 

Vh 

x^<x^ 

klMa:2l,n3|,|a;4|,|i5|<2L + l 

where 


5(Xi, X2, X3, X4, X5)dXidX2dX3dX4dX5, 


(5.39) i^( 


(5.40) 5 (xi, • • • , X5) := lime ®P([xj, Xj + e]n^/0Vl<j<5, (X4, X5) n .^ = 0) 


is the density of finding a point at Xi for each 1 < i < 5, with no point in (x 4 ,X 5 ). By 
the definition of El in (11.4611 and the duality between the forward and dual Brownian 
web (>V,W), we see that ^ is the point process generated on M at time t by coalescing 
Brownian motions in the Brownian web W starting from every point in M at time t — 

By the negative correlation inequality in Lemma IC.31 we can bound 

iL4^5(xi, • • • ,X5) 

< lime“^P([xj, Xj + e] PI / 0, j = 1, 2,3)P([xj, Xj + e] n ^ / 0, j = 4, 5 ; (X4, X5) n ^ = 0) 
= iL(0)3iL^(x4,X5), 


where by Lemma IB.51 


and 


iL(0) = lime ^P([x, x + e] n | / 0) 


2 

\/t< 5 A t 


for all X G M, 


K^{x, y) = lim e ^P([x, x + e] fl | 7 ^ 0, (x + e, y) n ^ = 0, [y, y + e] fl ^ 7 ^ 0) for x < y. 
By Lemma fC.61 for x < y, 

K‘'{x,y) < Cs,t{y - x) A 1 

for some Cs,t > 0 depending only on 5 and t. Substituting the above bounds into (15.391) . 
using the definition of " 0 , and separating the integration into two regions depending on 
whether 0 < X 5 — X 4 < y/h or X 5 — X 4 > y/h, it is easily seen that the integral in (j5.39p 
is uniformly bounded for 0 < h < This implies the uniform integrability of Efi{Xp^) 

for 0 < /i < ^, and hence that of _ 


Step 2 (Continuity of E[L^®4>(T’/^®)]). Recall from (|1.53p - (ll.56n that + 

We will prove the continuity for each component of the generator. 

By our assumptions on g and (j), |Lj®$|oo, iLa^'hloo < < 00. It was shown in (13.4 p 

that for any t > 0, for Lebesgue a.e. x = (xi, • • • , x^) G M”, the distance matrix rp®(xj, Xj) 
converges a.s. to r^^{xi,Xj) as s ^ t, and this conclusion is also easily seen to hold for 
t = 0 by the assumption T’jp® G U®. Therefore, almost surely w.r.t. (T’f^®)<j>o, 

(5.41) hmLf ch(T^“) ^ lPcI>(aiCS) and 

s^t s^t 

Therefore E[L^^<^>{Xp^)] and E[lP 4>(T’P)] are continuous in t > 0 by the bounded 
convergence theorem. 

We now turn to the continuity of E[Lp®<l>(T’/^®)]. We first prove the continuity at t = 0 
and later point out how it extends to t > 0. Let Eq C M be defined from as in (11.461) . 
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which are the boundaries of balls of radius | in Let us follow paths in the Brownian 

s 

web W starting from points in Eq C M at time 0, which determine the evolution of these 

s 

boundaries, and denote the point set on M generated at time s > 0 by -Eg^(s). Then 

Eg C Eq{s). By our regularity assumption on (j), only resampling at the boundaries of 
balls of radius S or more has an effect on Therefore for s E (0, (5/4), we have 

(5.42) 

^ dxr, 

i<i^j<n x^,xMyEy-.yeE^,^%)} 

where Oijcj) is defined as in (11.191) . and r := {r^^{xi,Xj))i<ij<n- By our assumptions on g 
and (/, the fact E U®, and our construction of in terms of the (dual) Brownian 
web, it is then easily seen by dominated convergence that 

(5.43) limLr<h(T’/"®) = Li.<h(T’(p®) almost surely. 


To prove the continuity of E[L‘^^‘h(d:’/^®)] at t = 0, it only remains to verify the uniform 
integrability of Lp®4>(T’f^®) for s close to 0, say s E [0,(5/4]. We will achieve this by 
showing that has uniformly bounded second moments. 

Note that because g is assumed to be supported on [—L,L]”, we have 


(5.44) 


LP4.(A^fS)| < n (-L,L)|. 


By Lemma IC.21 Eq^‘^{s) is negatively correlated, and hence by Lemma IC.51 we have 


(5.45) e[|.bJ/^(s) n (-L,L)|^ 


< 2E 


|sJ/^s)n(-L,L)|j + [E|i?g'/'(s)n(-L,L)|j'. 


Thus it suffices to bound E[|Sq^^(s) n (—L,L)|] uniformly in s E [0,5/4]. 

Since £(q^^(s) is obtained by evolving coalescing Brownian motions starting from E^^^, 
we can bound 

(5.46) E[|L;o'/"(s)n(-L,L)|] < n [-2L, 2L] | + 2 E[|eF+']xW n (-L, L)|], 

i>2L 


where c M is the point set generated at time s by coalescing Brownian motions 

in the Brownian web W starting from everywhere in [i, f + 1] at time 0. 

Note that the first term in (|5.46p is finite and independent of s > 0. We now treat the 
second term. For each i > 2L, let us denote n {—L, L) by which is also a 

point process on (—L, L) satisfying negative correlation. In particular, 


CXD OO CX3 ^ « 

(5.47) E[\C,l\] = Y^(Kl\ >n)< > If < E / 

—1 —1 —1 ^v2vrs Ji-i 

n=l n=l n=l '' 


2 s dx 


where we used Lemma IC.41 and the observation that, Cg l ^ ^ implies that the Brownian 
motion starting at (i, 0) in the Brownian web must be to the left of L at time s. Since 

L is fixed and large, I^l ^~^dx < a for some a E (0,1) uniformly in i > 2L and 
s E [0,5/4], and hence 


mi,L\] < 


TOO 

Ji-1 


' 2 s dx 


(1 — a)\/27rs 


/ OO —2L- 1 

i-L e 2 dx 

vs 


(1 — a)\fTn 


(5.48) 
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It is then clear that X]i> 2 L lI] tends to 0 as s | 0 and is uniformly bounded for 
s G [0, (5/4], which concludes the proof of the uniform integrability of for s G 

[0, (5/4], Therefore is continuous at t = 0. 

The continuity of at t > 0 can be proved similarly. Tors G 

we can use the same representation as in (I5.42p by treating t — ^ as the starting time 0 
in (|5.42|) . The a.s. convergence lim^^t = Li.4>(d/p®) then follows as before. The 

uniform integrability of follows easily from the density estimate, Lemma lB.51 

if we restrict to s G [t — + ^], since only resampling at locations occupied at time 

s by Brownian web paths starting from M at time t — ^ has an effect on 

Step 3 (Proof of ()5.2|l L We have thus far shown that E[<h(T’/^®)] has a continuous right 
derivative on [0, oo). Note that E[$(T’/^®)] is also continuous on [0, oo) 

since has a.s. continuous sample paths by Theorem 11.271 Equation (15.21) then 

follows from a variant of the fundamental theorem of calculus, formulated as Lemma 15.21 
below. This finally completes the proof of (|5.2I) and that of Prop. 15.11 ■ 

Lemma 5.2. [Fundamental Theorem of Calculus] 

Let / : M —)• M 6e continuous, with a continuous right derivative D~^f{x) := lim/ij^o 
h) — f{x)). Then 

f{x) = /(O) + [ D+f{y)dy for all x G M. 
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Proof. Let g{x) := /(O) + D~^f(y)dy, which is clearly differentiable with continuous 
right derivative D^f. Then h := f — g is continuous with h{0) = 0 and right derivative 
D'^h = 0. It suffices to show that h = 0. 

For e > 0, let /ie(x) = h{x) + ex. Then D~^he = e > 0. This implies that h^ is 
non-decreasing on R, since otherwise if /ie(x) > he{y) for some x < y, then at the point 
xq G [x,y] where achieves its maximum on [x,y], we have D^h^{xo) < 0, contradicting 
D'^he = e. Letting e 0, the monotonicity of h^ then implies that h is also non-decreasing. 
Applying the same argument to —h shows that h is also non-increasing. Thus h = 0. ■ 

5.2. Generator Action on General Test Functions. In this section, we extend the 
validity of the integral equation (|5.2p in Prop.jSTjto general G 11^’^, assuming G U®. 
The complication is that the generator identified in m, which acts on regular test 
functions <I> G 11/’^ and evaluated at regular states G U®, can only be extended 

to general test functions $ G 11^’^ provided that we restrict to the regular subclass of 
states G U® introduced in Definition 11.391 Fortunately for each t > 0, almost surely 
ddp^ G U®, which makes it possible to extend from 4> G IlJ’^ to G 11^’^. 

Proposition 5.3. [Generator action on general test functions] 

Let be the CSSM genealogy process with G U®, and let $ = g II^’^. Then 

(a) The integral equation ()5.2p still holds, and E[L'"®4>(A/‘^^)] is continuous in t > 0. 

(b) If Pdp^ G U®, defined as in Definition \ 1.391 then the generator equation (15.ip still 
holds, and E[L'^®<I>(A’/^®)] is continuous in t > 0. 

Proof. We will approximate g II^’^ by 4>; G II/’^ as follows. Let p : [0, oo) — R 

be continuously differentiable with p{x) = 0 for x G [0,2], p'{x) > 0 for x G (2,3], and 
p{x) = X for X G [3,oo). For each I > 0, denote ppx) = lp{x/l). Then pi is a smooth 
truncation with supj,>Q \pi{x) — x| ^ 0 and p'i{x) 1 for each x > 0 as Z | 0. Given 
(p = 4>{{rij)i<i<:j<n), let cpi := p o pi. We then define the truncated version of by 

<|)j := 
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It is easily seen that <!>; E nj’^, and hence Proposition 15.11 can be applied. We will then 
deduce Proposition 15.31 bv taking the limit 11 0. 

(a) We will take the limit / 0 the integral equation (15.21) . In Step 1, we will show 

that (j5.2p also holds for d>. In Step 2, we will prove the continuity of 
in t > 0. Note that without assuming E Ilr’^ or dijp® E U®, may not be 

well-defined. 


Step 1 First note that — <I>|oo ^ 0 as / 0 by our assumption on g and 4> and the fact 

that sup 3 ,>g \pi{x) — x| ^ 0 as / I 0. Therefore, 

(5.49) hmE[ci>;(TP)] = E[ci>(TP)] and 

Recall the decomposition of -|- Lp® in Prop. ITTI Note that |Lj®<h|oo, 

iL^oo, iL^^^oo are all uniformly bounded by some (7$ < oo independent 

of 1. Also, by our assumptions on g, (p and pi, for each s > 0 and a.s. every realization of 
Tps , we have 

(5.50) ®) ^ and LP$;(A’f S) ^ LP4>(Tf ®) as U 0. 


Therefore, by the bounded convergence theorem, 


(5.51) 


hm/ E[(L)r^ci>, + L^^4.,)(T,^^)]ds= / + L^^^){X^^)]ds. 

4U JQ JQ 


For the resampling generator note that for any s > 0 and a.s. every realization of 
vCS 

‘^S ! 

(5.52) 


|LP<h(TfS)|< [ 


^{xi=xj}\9ix)\\ieij(p-^)iL^^)\ JJ dx. 




Xi,Xj^{x'^^x~:x^Es\ 


l<T<n 


— Cn,g,(j) ^ ^ dx A 1, 
xe{-M,M)nEs 


where Eg is defined as in (15.4h . which is the subset of M that are points of multiplicity in the 
dual Brwonian web W, dx = r^®(x^,x“) is the distance between the two points in Xp® 
with the same spatial location as x E Eg, M > 0 is chosen such that supp((j() C (— M, M)"', 
and we used the assumption that (p has bounded derivative. Such a bound holds for (pi 
uniformly in Z > 0. If Y2x&{-M M)nEs A 1 < oo, then by dominated convergence, we 
have 

(5.53) lnnLP$;(Tf^) = LP$(Tf®). 

To prove the analogue of (I5.5ip for Lp®<I>, by dominated convergence, it only remains to 
show 



4 A 1 

xe{-M,M)nEs 


(5.54) 


ds < oo. 
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Note that for each s E (0, t), 
(5.55) 


E 


dx A1 

xe{-M,M)nEs 


= E 




l{d2;Al>'u}d'U 


= / E 


xG{-M,M)nEs 
2sAl 


{x E (—M, M) n Eg : dx > u} 


dtt 


< 


E 


{x E (—M, M) D Eg : dx > u} 


d?x + 


E 


{x E (—M, M) n Eg : dx > 2s} 


dtt 


|•2sAl 

/ E 

Jo 


2sAl 


^Rx{s-«/2} p 


"I 


du + 


E 


/2sAl 


^Rx{0} p 


du 


= CMV27aT + CM -—< CM(l + 

^/s 

where denotes the point set on M generated at time s by the collection of paths in the 
Brownian web W starting from the space time set A C and we used n 

[a, 6]| ] = by Lemma IB.51 Inequality (15.541) then follows. To summarize, we have thus 

shown that (15.2p is also valid for a general polynomial E B^’^. 

Step 2 To prove the continuity of E[L'^^<h(T’/^®)] in t > 0, we again decompose into 
its three summands. First note that the continuity of E[L^®<I>(d4^®)] and E[L^®<I>(T}^®)] 
on [0, oo) follow by the same arguments as that for E[L^®4>(d4‘^^)] and E[L^®<h(d4‘^®)] in 
the proof of Prop. 15.11 

To prove the continuity of E[Lp®<I>(T’/^®)] in t > 0, we fix t > 0 and a truncation param¬ 
eter e E (0,t). For each s E (t — e,t -|- e), we decompose into 

and where both and are defined as in 

(I5.42|) . except that resampling therein is carried out by summing over 

for 

(o.oo) 

l{x^=x,} for L<^+^-*cI>(TfS), 

Xi,Xje{y+,y-:yeE^\El+‘^-A 
where Eg and Ef are defined as in (|5.4p . 

The same argument as in the proof of the continuity of E[L}"®<I>(T}^®)] in Prop. 15.11 
shows that 

(5.57) limE[|4^"+"-‘$(Tf3) - 4^^+"-*4 '(tP)|] = 0. 


s — 


On the other hand, for s E [f — |,f -|- |], by the same calculations as those leading to 
P5.551) . we have 


E[|L 


<e+s—t 


CS\ 


(5.58) 


— Cn,g,(p^ 




E 


< Cn,g,^E 


dr A 1 


dr A 1 


E 

x&(-M,M)riEs\El+‘-'^ 

< CMVe + s-t < C'MyfE 


x^{-M,M)nEs 

dx <2(e + s — t) 


Since e > 0 can be chosen arbitrarily small, p5.57p and (15.581) together imply that 


limE[L;t^4>(T}"^)] = E[L}^^4>(T^^^)] when t > 0. 

s—>-t 
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(b) We now verify the continuity of (and hence of at 

f = 0 under the additional assumption that G U®. Together with (15.2p . this also 
implies that the generator equation (|5.1D holds for a general polynomial G 11^’^. 

As before, we separate into and where the 

truncation parameter e > 0 is fixed and small. Equation ()5.57p also holds with f = 0 
by the same argument as that for the continuity of E[Lp®<l>(A’/^®)] at f = 0, when $ G 
rir’ . Indeed, in both cases, only resampling between individuals with sufficiently large 
genealogical distance contribute to the generator action on 
It remains to show that 

(5.59) sup E[|L<^+^cI>(A’fS)|] ^ Q. 

0<s<e/2 


For 0<s<e/2, we can separate the resampling terms according to whether the 
genealogies of the two resampled individuals merge above or below time 0 , and write 


E[|L<"+"^>(Af^) 


< 


C*n,g,(/jE 


E 


xG{-M,M)nEs 

^ 2 (e + s) 




xe(-M,M)nEs 
^x ^ C0'2s] 


dx ~\~ 


E 


x^{-M,M)nEs 

^x ^(2s,2(e+s)) 


(5.60) 


< 


Cn,g,^M^/s + 


x£( — M,M)nEs 

dx€{2s,2(e-\-s)) 


It remains to bound the expectation on the r.h.s. above, which originate from resampling 
between individuals whose genealogical distance depend on the distance of their ancestors 
in at time 0. Let .^g i^j^g point set on M generated by the collection of 

paths in the dual Brownian web W starting from the space-time set [—M,M] x {s}. Let 
us order the points in ^g denote them by zi < 2:2 • • • < zMs+i, with a total 

of Mg + 1 points. Then by our construction of Ap®, we have 


(5.61) 


E 


'Y 

xG{ — M,M)nEs 

d-x ^(2s,2(e+s)) 


IVls 

= «[E 




M, 

< CMyTs + E [ X] + 1 X 24^0 ^i+l) 

i=l 


CS/ 


where we used EpM^] = because by duality between the Brownian web and its dual, 
Mg is exactly the number of points in the interval [—M, M] occupied at time s by Brownian 
web paths starting from R at time 0, and hence Lemma lB.5l can be applied. For each pair 
{zi,Zi+i) with r^®( 2 i, Zj+i) < 2e, by the properties of A’fp®, there must exist a Ui ^ Eq 
with Zi < Ui < Zi+i such that rQ^{zi, Zi+i) = dy^. Note that the yi, 1 < i < Mg, are all 
distinct with dy^ < 2e. We can now separate the contribution to the second term in (j5.6ip 
into two groups. 

The first group consists of contributions from pairs {zi,Zi^i) with —2M < Zi < Zj+i < 
2M. The total contribution from these terms is uniformly dominated by Yl xe[- 2 M, 2 M]nEo dx, 

dx ^2e 

which tends to 0 as e 0 by the assumption G U®. 

The second group consists of contributions from pairs {zi,Zi+i) with either Zi < —2M 
or Zj+i > 2M. We bound these terms by 2e times the expected cardinality of such pairs. 
By the duality between W and W, the cardinality of such pairs of (^j, Zj+i) is bounded by 
the cardinality of the point set on (—M, M) generated at 

time s by paths in W starting from the space-time set{x G R : |x| > 2M} x {0}. As shown 


















54 


ANDREAS GREVENi, RONGFENG SUN 2, ANITA WINTER ^ 


in (j5.47p . the expected cardinality of is uniformly bounded 

in s G [0, Z] for any I > 0. Therefore, uniformly for s € [0,e/2], the second term in (j5.61l) 
is also bounded by a function of e, which tends to 0 as e J, 0. This concludes the proof of 
(I5.59|) and Proposition 15.31 I 


5.3. Proof of Theorem 11.401 We now prove Theorem II. 4UI Note that the various path 
properties listed in Theorem 11.401 has been verified for the CSSM genealogy process 
in Prop. 11.251 and Theorem 11.271 except for the claim that for each t > 0, G U* 
almost surely. To verify this last claim, we need to show that a.s., 

(5.62) ^ ,x~) < oo for each M > 0. 

xGEtr\(-M,M) 

where is defined as in (15.4|) . Since for each e > 0, the set {x G Et Ci (—M, M) : 
r^^{x~^,x~) > e} is a finite set because G U®, (|5.62p is reduced to showing that a.s.. 


) < OO 

xeEtn{-M,M) 


for each M > 0. 


The l.h.s. above has been shown in (j5.55p to have finite expectation, and hence is finite 
a.s. Therefore G U® a.s. 

To conclude the proof of Theorem ll.40l it only remains to verify the martingale property, 
namely that given <1> G 11^’^ and G U®, for each 0 < s < f, we have a.s. 


(5.63) E $(tP) - 4>(Tf'^) - ^ (L^®$)(T,fS)du (T,^®) 


J0<«<s 


= 0 . 


Consider first the case s > 0. Then a.s. G U®. We can use the Markov property of 
and apply the integral equation (|5.2p for <1> G 11^’^ and initial state Tp®, as established 
in Prop. 15.31 Equation (|5.63p then follows provided that we can apply Fubini’s Theorem 
to interchange the integral with expectation in 


(5.64) 


E 


J S J s 


As in the proof of Prop. 15.31 we can decompose as + Lp®. The aging and 

diffusion part of the generator action on <1> can be uniformly bounded in time and in 
while the resampling part can be bounded as in (15.521) . which is shown to be integrable 
w.r.t. E[Jq •] in (I5.54p . Therefore Fubini can be applied, and (15.631) holds for s > 0 a.s. 

To treat the case s = 0, we take expectation in (15.631) for s > 0, i.e., 


E 


c1>(aP) - cD(A,^«) - 


GS\ 


[\L^^<^){Pd^^)du 

J S 


= 0 , 


and then let s | 0. By the a.s. continuity of in s > 0, we have E[<l>(A'p®)] <1>(A'^®). 

The convergence of 


mathbhE 


p(LCS$)(TCS)du] 


as s 4- 0 follows by dominated convergence, using the same 


argument as that for (j5.64l) . Therefore (j5.63l) also holds for s = 0, which proves the desired 
martingale property. ■ 
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Appendix A. Proofs on marked metric measure spaces 

In this section, we prove Theorems 11.51 and 11.81 derive a relative compactness condition 
for subsets of , and a tightness criterion for laws on . Furthermore we formally treat 
pasting of trees. Our starting point for the first of these points is Remark 1 1.41 that can 
be identified as a subspace of (M^ )^, endowed with the product P-marked Gromov-weak 
topology. 

Proof of Theorem 11.51 As noted in Remark 11.41 we can identify M'^ as a subspace 
of endowed with the product P-marked Gromov-weak topology. Furthermore, 

under this identification, is a closed subspace of (M^)^. It was shown in [DGPlll 
Theorem 2] that M]^, the space of P-mmm spaces with probability measures, equipped 
with the P-marked Gromov-weak topology, is a Polish space. The same conclusion is 
easily seen to hold for Mj . Therefore (Mj is also Polish, which implies that any closed 
subspace, including is also Polish. I 


Proof of Theorem 11.81 For each /c E N U {0, oo}, let 

& := Unmo^n ■= ^ ^ X P^M)}, 

/n\ 

where x P”, M) is the space of bounded continuous real-valued functions on x 

P” that are k times continuously differentiable in the first (g) coordinates, and 

r,/x)) := J ■■■ J 4>{r,v)fi^^{d{x,v)) for each (A, r,/x) E M]^, 

with z ■= ir{xi,Xj))i<i^j<n and v := (vi,, Vn). 

We use once more Remark 11.41 to now identify as a subspace of nfceN^<cfc> 

endowed with the product P-marked Gromov-weak topology. Here denotes the space 
of P-mmm spaces with a measure of total mass at most c, and e.g. Ck ■= ||iAfc||oo&(F) for 
any r > 0 with supp(V'fc) C Br{o). 

By |DGPlll Theorem 5], H^ := UnllJ^ is convergence determining in M]^ and A4 i(M]^), 
and hence also in and A4i(M<^). We now want to argue that this holds as well 
for c This follows immediately from |EK86[ Prop. 3.4.6] for measures on 

product spaces. ■ 

The following relative compactness criterion for subsets of M'^ follow easily from the 
identification of M'^ as a subspace of (Mj )^, and the relative compactness criterion for¬ 
mulated for subsets of in [GPW131 Prop. 6.1], for subsets of M]^ in |DGPlll Thm. 3] 
and for subsets in [ALW 14] Corollary 4.3]. 

Theorem A.l (Relative compactness of subsets of M^). Let F C M^, and let o be any 
point in V. For each A; E N, let Bk{o) denote the open ball of radius k centered at o. Then 
F is relatively compact w.r.t. the V-marked Gromov-weak"^ topology if for each A: E N, 
{{X,r, l{DeSfe(o)}A^(dxdx;)) : (A,r,/x) E F} is a relatively compact subset o/Mj, i.e., 

(i) The family of finite measures on V, 

Afc := {/x(A X dx;)l{^6s^(o)} : {X,r,iJ.) E F}, 
is relatively compact w.r.t. the weak topology; 
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(ii) For each e > 0, there exists L > 0 such that uniformly in {X, r, fa) G P, 

// '^{r{x,y)>L}'^{u,veBk{o)}K<ixdu)fi{dydv) < e; 

J J{Xxvy 

(iii) For each e > 0, there exists M G N such that uniformly in (X, r, ff) G P, we can 

find M halls of radius e in X, say ..., C X with B = such 

that fi{{X\B) X Bk{o)) < e. 

By the tightness criteria formulated for random variables with values in in |GPW13l 

Theorem 3], with values in Mj^ in |DGPlli Theorem 4] and with values in in |ALW 141 

Gorollary 4.6], we obtain the following tightness criteria for M^-valued random variables. 

Theorem A.2 (Tightness of M'^-valued random variables). Let o be any point in P, and 
let Bk{o) denote the open ball of radius k centered at o. A family of M.^-valued random 
variables {T) = (Xj, r*,//*)}*£/ is tight if for each k e'N, {Xj, r*, l{^gB^(o)}/ij(dxdu)}i 6 / is 
a tight family of Mj-valued random variables, i.e., 

(i) ^ du)}ig/ is a tight family of random variables taking values in 
the space of finite measures on V (equipped with the weak topology); 

(ii) {Xi,ri, fj,i{dx x Bfc(o))}jg/ is a tight family of random variables taking values in 
the space of metric measure spaees (equipped with the Gromov-weak topology). 

Using the characterization of relatively compact sets in given in Theorem lA.l] one 
can also formulate more concrete conditions for the tightness of a family of M'^-valued 
random variables, using concrete conditions for the tightness of a family of random metric 
measure spaces formulated in [GPWDOl Thm. 3]. 


Appendix B. The Brownian web 

In this section, we recall the construction and basic properties of the Brownian web. 
For a recent comprehensive survey, see [SSS15| . Recall from Subsection 11.41 the random 
variable (W, W) as constructed in |FINR.n4(IFINR,06j . and in particular from (ll.33p - (ll.35h . 
the state spaces of PI of W and PI of W. It has been shown [FINROd"] Theorem 2.1] that 
the Brownian web W can be characterized as follows: 

Theorem B.l (Gharacterization of the Brownian web). The Brownian web W is a ran¬ 
dom closed subset of II, whose law is uniquely determined by the following properties: 

(i) For every z G almost surely W(z) contains a unique path. 

(ii) For every finite n and deterministic points zi, - ■ ■ ,Zn^ {yV(zi) : i = 1, - ■ ■ , n} 
are distributed as n coalescing Brownian motions starting from zi, - ■ ■ ,Zn. 

(iii) For every deterministic countable dense subset P C W is almost surely the 
closure of {yV{z); z G V} in PI. 

The following result shows that every path in W can be approximated by the countable 
set of paths {W{z)] z G V} in a very strong sense (see e.g. |SS08l Lemma 3.4]). 

Lemma B.2 (Gonvergence of Paths in W). Almost surely, if (/n)neN o-n-d f are paths in 
W, starting respectively at times (sn)neN o,nd s, and fn^fin PI, then sup{t : fn(t) 7 ^ 
f{t)} s as n ^ 00 . 


The dual Brownian web W can be characterized as follows [FINE, 06] Theorem 3.7]: 
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Theorem B.3 (Characterization of the dual Brownian web). Let W be a Brownian web. 
Then there exists an almost surely uniquely determined H-valued random variable W de¬ 
fined on the same probability spaee as W, called the dual Brownian web, such that: 

(i) Almost surely, paths in W and W do not cross, i.e., there exist no f £ W and 
/ G W and s t sueh that {f{s) — fis)){f{t) — f{t)) < 0; 

(ii) TZW has the same law as W, where TZ denotes the refleetion map that maps each 
/ G W to an / G n such that the graph of f in is the reflection of the graph of 
f with respect to the origin. 

Below we collect some basic properties of the Brownain web which we will use. For 
further details, see [FINR041IFINR061ISS08] . The first property concerns the configuration 
of paths in W and W entering and leaving a point z G For each z = {x,t) G 
let moutiz) denote the cardinality of >V(z). We will let min(z) denote the number of 
equivalence classes of paths in W entering z, where a path / G W is said to enter z 
if it starts before time t and f{t) = x, while two paths f,g £ W entering z are called 
equivalent if they coalesce before time t. Note that m\n{z) = 2 if and only if z is a point of 
coalescence between two paths in W. Similarly, we can define min(z) and mo^t{z), based 
on the configuration of paths in W. The pair (min,mout) is called the type of z in W. 

We cite the following result from [TW981 Prop. 2.4] and [FINROGl Thm. 3.11-3.14]. 

Lemma B.4 (Special points for the Brownian web). Let W and W be a Brownian web 
and its dual. Almost surely: 

(1) The set o/z G with (min(z),mout(-7)) = {'<^m{z),rho^t{z)) = (0,1) has full 
Lebesgue measure on M^. 

(2) For each t G M, the set of z = {x',t') G M x {t} with mout(^) > 2 is a countable 
set, with min(z) > 1 for each such z, i.e., z lies on the graph of some path in W 
starting before time t. 

Next we cite a result on the decay of the density of coalescing paths started at time 0. 

Lemma B.5 (Density for the Brownian web). For t > 0, let — {/(t) : / G 

UxeKi^(^)O)} denote the point set on M generated at time t by the collection of coalescing 
paths in the Brownian web W started at time 0. Then for any a < b, 

(B.l) = 

This result can be easily derived by using the duality between W and W, namely that 
^®x{o} (^x, x -\- e) tj) ii and only if the two paths in W starting from (x, t) and (x -|- e, t) 

do not collide on the time interval [0,t]. See e.g. [SS081 Prop. 1 . 12 ], where such a density 
calculation is carried out for a generalization of the Brownian web known as the Brownian 
net, which in addition allows branching of paths. 

Appendix C. Correlation Inequalities for Coalescing Brownian motions 

In this section, we prove some negative correlation inequalities for a collection of coalesc¬ 
ing Brownian motions, which are used in Section [5j Similar inequalities have previously 
been established in [MRTZO^ . see also [NRSOSl Remark 7.5]. Here we deduce more gen¬ 
eral negative correlation inequalities from Reimer’s inequality applied to coalescing random 
walks. 

In van den Berg and Kesten [vdBKO^ . Reimer’s inequality was applied to continuous 
time coalescing random walks with a generalized coalescing rule. Since we are interested 
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in coalescing Brownian motions, discrete space-time coalescing random walks with instan¬ 
taneous coalescing already provide an adequate approximation, and Reimer’s inequality 
can be applied without any complication to the latter. 

First we recall Reimer’s inequality |ReiOOl . For each i G / := {1, • • • , n}, let Si be a finite 
set with a probability measure on Si. Let Q = Si x S 2 ■ ■ ■ x Sn and /r = /ri x • • • x 
For K C I and u = {u}i)i^j, define the cylinder set C{K,u) := {oj' G 0 : w' = w* V i G K}. 
Given two events A, B G we say A and B occur disjointly for a configuration cj G n 
if there exists K C I such that C{K,uj) C A and C{I\K,oj) C B. The set of cu G n 
for which A and B occur disjointly, which we call the disjoint intersection of A and B, is 
denoted by 

AOB :={uj£n:3K Cl s.t. C(K,uj) C A and C{I\K,uj) C B}. 

Then Reimer’s inequality asserts that, for any two events A,B C n, 

(C.l) fi{ADB) < fi{A)n{B). 

Now we apply this inequality to coalescing random walks. We recall first the con¬ 
struction of discrete space-time coalescing random walks. Let = {{x,t) G : 

X + t is even}. Let be i.i.d. random variables taking values in {±1}. A di¬ 

rected edge is drawn from each z = {x,t) G Zg^g^, which ends at (x -|- l,f -|- 1) if iOz = 1, 
and ends at (x — 1, t -|-1) if = —1. This provides a graphical construction of a collection 
of coalescing random walks, where the random walk path starting from each z G Zg^g^ is 
constructed by following the directed edges in Zg^g^^ drawn according to to. 

To illustrate how Reimer’s inequality is applied, let , 1 < f < n, be a 

collection of coalescing random walks constructed as above with starting points {xi,ti) G 
Zg^en, and assume for simplicity < 0 for all 1 < z < n. For f G N, let = {x G 
Z : X = for some 1 < z < nj. Let Oi,--- ,Ok be disjoint subsets of Z, and 

let Ai = {uj : 13 Oi 0}. The crucial observation is that, if the events (Aj)i<j<„ 

occur simultaneously, then they must occur disjointly w.r.t. because of the 

coalescence. Namely, 


k 

(C.2) f| A, = AiDAa • • • DAfc. 

i=l 

Reimer’s inequality (jC.ip then gives the negative correlation inequality 

k k 

(C.3) P(f|A,) 

i=l i=l 

The same reasoning allows us to choose each Aj to be an increasing event of the occupation 
configuration i.e., given uj and uj' with respective occupation configurations C 

n Oj, if a; G Aj, then also oj' G Ai. Note that Reimer’s inequality may even be applied 
to the disjoint occurence of the event Ai with itself, which we use later in the proof of 
Lemma 1C.61 

Using Reimer’s inequality as illustrated above, together with the invariance principle 
for coalescing random walks, we will deduce a host of negative correlation inequalities for 
coalescing Brownian motions, which we formulate next. 

Definition C.l (Negatively correlated point processes). We say a simple point process ^ 
on M is neqatively correlated, if for any n G N and any disjoint open intervals Oi, • • • , On, 
we have P(nt n / 0}) < nr=i n O, ^ 0). 
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Lemma C.2 (Negative correlation for colaescing Brownian motions). Let A C K x 
(—00,0], and let denote the point set on M generated at time t > 0 by the collec¬ 
tion of coalescing Brownian motions in the Brownian web W starting from A. Then 
is negatively correlated. 

Proof. By monotone convergence, it suffices to consider the case when A consists of a 
finite number of points {zi, • • • , Zk}- The fact that is negatively correlated then follows 
directly from the negative correlation inequality (IC.3h for coalescing random walks, and 
the distributional convergence of coalescing random walks to coalescing Brownian motions 
in the local uniform topology (see e.g. |NRS05l Section 5]). ■ 

Lemma C.3 (Decoupling of correlation functions). Let be as in Lemma IC.H Let 
CLi < bi < 02 < b 2 ■■■< On < bn- Then for any 1 < j < n — 1, 

(C.4) 

n 

1P( ^ ^ ^ ^ = 0}) 

i=l 

< n {aj,bj) + 0,|i^ n {bj,aj+i) = n {aj+i,bj+i) / 0^ n {ai,bi) 0). 

l<i<n 

Proof. As before, this follows from approximation by discrete space-time coalescing 
random walks and Reimer’s inequality. Note that for the discrete analogue of the events 
in the second line of (|C.4p . if they all occur, then they must occur disjointly. ■ 

Lemma C.4 (Negative correlation for occupation number). Let be as in Lemma \C.Si 
and let B gM. have finite Lebesgue measure. Then for any /c G N, 

(C.5) P(|^ 4 ^ nB\>k)< P(|^i^ n R| > 1)^ 

Proof. By monotone convergence and approximation by open sets, it suffices to consider 
the case when A C M x (—oo,0] consists of a finite number of points, and B is the finite 
union of disjoint open intervals. In fact, the argument is the same if R is a bounded open 
interval, say ( 0 , 1 ). 

We proceed by discrete approximation. Let be the subset of Z occupied at 

time t G N by a collection of coalescing random walks on Zg^g^ starting from zi = 
{xi,ti),-" = {xn,tn) £ '^even with < 0 for all 1 < z < n. Given O C Z and 

for A; G N, let Afc = {cu G D : |Zi n 0| > k}, where u = the i.i.d. {±1}- 

valued random variables underlying the graphical construction of the coalescing random 
walks. We note that 

k 

(C. 6 ) Afc c AiD-^-DAi. 

Indeed, if A^. occurs, then we can find k disjoint random walk paths, each of which occupies 
a distinct site in O at time t. Reimer’s inequality (|C.ip then implies P(Afc) < P(Ai)^. 
Inequality (IC.5p then follows by the distributional convergence of coalescing random walks 
to coalescing Brownian motions in the local uniform topology. I 


Lemma C.5 (Moment bounds for occupation number). Let be a simple point process on 
M with a locally finite intensity measure pL, which is absolutely continuous w.r.t. Lebesgue 
measure on M. Iff is negatively correlated, then for any Lebesgue measurable R C M with 
p{B) < oo, and for any k gN, we have 


E 


lenRl"] < ^ 


m=l 


m 


k—m 


p{By 


(C.7) 


m 
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Proof. Let B be an open interval, say (0,1). For n E N, let Dn = {i2~"' : 0 < i < 2”}, 
and let D = By our assumption that the intensity measure /i is absolutely 

continuous w.r.t. Lebesgue measure, ^ H -D = 0 almost surely. For 1 < i < 2”, let 
= l{^n((i-i) 2 -'i,i 2 -'i)^ 0 }- By the assumption that ^ is a simple point process, and 
by monotone convergence, 

(C.8) 


E[|^n(0,l)|^] =E 


lim 

1<2<2 




= lim E 

n^oo 


[(E'F)' 


= lim 


l<i<2" 


>oo 

l<ii 


E E 


K 

[n 

i=i 


in) 


Note that /r(0,1) = E[|^ n (0,1)|] = lim^^oo E[/j”^]. Given i := (ii,--- ,ik) E 
{1, • • • , 2"’}*^, let m{i) = |{ii, • • • , ik}\, and denote the m{i) distinct indices in {ii, • • • , i^} 
by (Ti(z) < • • • < am{i)- Then by the negative correlation assumption, 

(C.9) 


E E 

,4<2" 


n k An) 

h 


< A<2. nri'i = EEi Ei<„< 0” i 

< EEi ( ELi EIT’I)” Et=i m mMO, !)"•, 

where f{k,m)m\ = , 4 ) E {I,-- - ,2"'}*' : {n,-" , 4 } = Wir" which is 

easily seen to be independent of the choice of 1 < ui < • • • < am < 2”. By hrst picking 
m indices out of {ii, • • • , i^} and assign them values ai < ■ ■ ■ < am respectively, we easily 
verify that f{k, m) < which proves (IC.7p for B = (0,1). By the same argument, 

()C.7p holds for hnite unions of disjoint open intervals, and by monotone convergence, for 
open sets as well. Since any Lebesgue measurable set can be approximated from outside 
by open sets, again by monotone convergence and the fact ^CiE = (/) a.s. for a given E C M 
with zero Lebesgue measure, (IC.7p also holds for any Lebesgue measurable B. I 


We also need the following estimate on the constrained two-point correlation function 
for the Brownian web. 

Mx -f Ol 

Lemma C.6 (Constrained two point function for the Brownian web). Let ^ ‘ be as 
in Lemma W. 51 Let t > 0. For a < b, let I[a,b] denote the event that n [a,b] ^ 0. 

Then for any xi < X 2 with A := X 2 — xi, we have 

. -Ai 

1 r n Ae 4 t 

(C.IO) 3^2) ■— (^) ~ ^ E n/[a, 2 ^a; 2 +< 5 ]] ~ 3 ■ ^ 

Proof. By translation invariance of may assume xi = 0, and let X 2 — xi = 

A > 0. Let F’(o,t)j ^((5,b’ ^(A,L be the dual coalescing Brownian motions 

in W starting at respectively (0,t), (<5, t), (A,t) and (A -|- 6,t). Let 4,6 be the time of 
coalescence between W(^a,t) and Then by the duality between W and W, almost 

surely, the event H I[s^a] occurs if and only if tq^s < 0 and 4 a < 0, and the event 
-^[o,<5] hi I[5,A+S] occurs if and only if 4,5 < 0 and 4 a+5 < 0. Since 4 ^a] C I[s,a+S] and 
d[5,A+S]\d[S,A] = -^[6,A] h d(A,A+S]^ 

, . n I[xi+S,X2] h d[x2,X2+S^ 

— ^0^5,A (F3,5 > t,T^^A+& > t) Eq ,5^a+5 v0,S > tiT^S,A ^ t)i 
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where we have reversed time and replaced n)(a,t) for a = 0 , (5, A, A + 5 by independent 
standard Brownian motions Bi,B 2 and B^ starting from 0, 5, and A (or A + 5), and Ta,b 
is the first hitting time between the two Brownian motions starting from a and b. 

By the Karlin-McGregor formula [KM59] , the transition density for three one-dimensional 
Brownian motions and B^ starting from xi < X 2 < X 3 to end at locations yi < 

2/2 < 2/3 at time t without ever intersecting along the way is given by the determinant 

Det{pt{xi,yj)i<ij< 3 ), where pt{x,y) = ^ ^ . Therefore we define 


(C.12) D := > t, ts,a+s > t) - > t, tsa > t). 


The r.h.s. can be written as: 


f f f 

Pt{o,yi) 

Pf (0,2/2) 

Pi (0,^3) 


Pi(0,yi) 

Pi( 0,P2) 

Pi (0,2/3) 

= /// 

pt{d, yi) 

Pi (5, P2) 

Pi (5,^3) 

- 

Pi (5, yi) 

Pi ( 5,2/2) 

Pi (5,2/3) 

J J J 

yi<y 2 <y 3 

Pt{A + 5,2/1) 

Pt{A + 5 , 2/2) 

Pi(A + 5,2/3) 


Pi(A, 2 /i) 

Pi(A, 2/2) 

Pi(A+, 2/3) 


After some elementary manipulation, we obtain that D is given by the expression: 
(C.13) 


D = 



yj+y^+yj+S^+A^ 


yi<y2<y3 


(27r/)l 


1 

e t — 1 

e~^fit,yi,6,A) 


1 

iy2 

e t — 1 

f{t,y 2 ,S,A) 


1 


Sy3 

e ‘ 


- 1 


Ay 


e * /(t, 2 / 3 , <5, A) 


dy, 


2(yi-A)S-S^ 

where f{t,yi,6,A) = e 2 t 

\\y\\^ 

factor e 2 * allows us to take D5 


-2 


1. We then Taylor expand in 6, and note that the 
and pass the limit 5 | 0 inside the integral to obtain 


(C.14) 


D 

hm--^ 



yj+y'^+yl+A^ 

1 

1 

1 

- [[[ 

2t 

yi 

Ayi 

e~{yi - A) 

2/2 

Ayn 

e~ ( 2/2 - A) 

ys 

e^(2/3-A) 

JJJ 

yi<y2<y3 

(27r)t/i 


dy. 


Expanding the determinant and performing the change of variable yi = xiy/t and A = Ay/t 
gives Kf = ((27r)it)“\/i +I 2 + I 3 ), where 


(C.15) 


h = 


(xi —A)^+x‘^-\-x‘^ 


(xi - A)(x3 - X2)dx, 


Xi<X2<X3 


h = 



x-]-l-(a ;2 — 2 A) +31-^ _ 

e 2 ( 0:2 - A)(a:i - X 3 )dx, 


X1<X2<X3 


h = 


x^+x^ + (x 3 —A)^ ^ ^ ^ 

e 2 (x 3 — A)(x2 — xi)dx. 


We then have 
(C.16) 

/i + /3 = - 

X2<X3 


Xl<X2<X3 


X 2 + {x 2 -A '^+^3 


2 (X 3 - X2)dX2dX3 JJ 

Xl<X2 


x^+x^-f (x 2 —A)^ 


2 (0:2 — xi)dxidx2 




^+x^ + (x: 2 -A)^ 


, . TT - a 2 

2 (x2 — xi)dxidx2 = —j=Ae 4 , 

V 2 
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and 

r r , (a;i-A)2 (a;3-A)2 , 

h= e '2 [e 2 — e 2 ){xi — xsjdxidxs 

(C.17) 

Xl<X3 

f f xl + (xi-A)^+xl , , vr ^ A2 

= yy e 2 — xsjdxidx^, = 4 . 

R2 

a2 


Together, they give K? = ^ . ■ 
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